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r ^r^n.mn OF THF INVENTION 

Receptors that bind specific molecules (e.g., a hormone, drug, cytok.ne or 
biochemical) have been identified on a mulnn.de of cell type, Receptors are found on 
Teen surf ce or. in the case of soluble receptors, are released into the serum. Effon 

as te direct*, toward iso.ation and charac.en.anon of a number of receptors 
„1 to study their physiological ro.es and ,0 explore possible therapeunc uses. The 
bWing of a parucu.ar target mo.ecu.e by a so.ub.e receptor admin.s.ered to a pauen. 
mav alleviate disorders mediated by the target molecule. 

Cenain receptors have been found ,0 compnse two separate polypepnde chatn 
assorted in the form of a complex. Such two-chain receptors often bmd the targe, 
molecule with greater affinity than that exhibited by one of the chains alone. 

LukerS inhibitory factor (LIF) is a po.ypept.de hormone Una, p.ays a cenual 
role in me regulation of diverse adul. and embryonic sys.ems. LIF acs on a vaneby o 
1 .^s anfhas mu.tip.e bio.og.cal acivities. The diversity in bio.og.ca, aenvuy , 
refiecl in *e various synonyms of LIF, which inc.ude hepa.ocy,e snmu.at.ng facto 
m (Baumann and Wong. J. >m mU nol MJ:1163 11989,); choHnerg.c nenx 
«a,io„ factor (Yamamori e, al.. Science 246: .4,2 [,990,); 
Hpoprotein Hpase inhibi.or (Mori e, a,., Blockm. BiopHy, Re, C«£ 1 a 10 
Wmy human in.erleukin for DA cells (Moreau e, al., Namre 336:690 [19881), 
Z a I facor CTomida e, a,., J. Biol 259:10978 [1984,), differ— 
"ry factor (Abe e, a,., J. Biol CHe m . 264: 894, (.989,); d.fferenuauon 
r„ h hL: acuv.ry (Sm.th and Hooper. Oevel HM . = „d d.fferennauon 
retarding facor (Koopman and Cotton. Exp. Cell. Res. 154:211 1 1984]. 

The c.omng of a leukem.a .nh.bttory fa- receptor (UF-R) has been reponed 

i • rwfin / 702839(1991). This recombinant single-chain Lir-K 
bv Geanne et al. in EMBU J . W.iojy 

plwpcpnde binds LIF. bu, wiU, ^ *» «" ^ U * 
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receptors found on certain normal cells. A receptor that binds LIF with higher affinity 
than that exhibited by this cloned single chain LIF-R would be desirable for certain 
applications. 

Oncostatin M is a secreted single-chain polypeptide cytokine that regulates the 
5 growth of certain tumor-derived and normal cell lines. Oncostatin M is produced by 
activated lymphoid cells. A number of cell types have been found to bind the 
oncostatin M protein. See, for example, Linsley et al., J. Biol. Chem., 264: 4282 
(1989). However, the isolation and characterization of an oncostatin M receptor have 
not been reported. 

10 

SUMMARY OF T HE INVENTION 

The present invention provides a receptor that has the property of binding both 
oncostatin M and leukemia inhibitory factor (LEF). The receptor comprises gpl30 
linked (preferably covalendy) to leukemia inhibitory factor receptor (LIF-R). The 

15 gpl30 polypeptide may be covalently linked to the LEF-R polypeptide by any suitable 
means, such as via a cross-linking reagent or a polypeptide linker. In one embodiment 
of the invention, the receptor is a fusion protein produced by recombinant DNA 
technology. Disorders mediated by either oncostatin M or LIF may be treated by 
administering a therapeutically effective amount of the inventive receptor to a patient 

20 afflicted with such a disorder. 

BRIEF DESCRIPTION OF THE DRAWINGS 
Figure 1 is a graph presenting the results of an LIF binding assay. Host cells 
transfected with vectors) encoding gpl30 or LEF-R were assayed for the ability to bind 
25 LIF, as described in example 1 . 

Figure 2 is a graph presenting the results of an oncostatin M binding assay. 
Host cells transfected with vector(s) encoding gpl30 or LIF-R were assayed for the 
ability to bind oncostatin M, as described in example 2. 

Figure 3 is a graph depicting low affinity binding of oncostatin M to host cells 
30 transfected with a gpl30 encoding expression vector, as described in example 2. 

Figure 4 schematically depicts a receptor of the present invention wherein Fc 
polypeptides derived from an antibody are used to link a gpl30 fragment to an LEF-R 
fragment. 

Figure 5 presents composite DNA and encoded amino acid sequences of a full 
35 length LIF-R, determined by comparing the sequences of cDNA and genomic clones. 
The signal peptidase cleavage site is marked with a vertical arrow. The transmembrane 
region is heavily underlined. Potential N-linked glycosylanon sites are marked with 
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asterisks. Hallmark residues associated with the hematopoieun family of receptors are 
shown boxed The horizontal arrow marks the point at which genomic sequence was 
used to derive the 3' coding region of LIF-R, since the cDNA clones employed in 
determining this sequence terminated with a stretch of A nucleotides at this point. 

Figure 6 presents the DNA and deduced amino acid sequences of cloned gpl30 
cDNA as reported by Hibi et al. in Cell 63:\ 149 (1990). A predicted signal sequence is 
underlined. The thick underline indicates a presumed transmembrane region. The sets 
of asterisks identify possible N-glycosylation sites. 

Figure 7 presents Scatchard analyses that demonstrate the interaction of a 
soluble gpl30/Fc fusion protein with soluble LIF-R/Fc in binding LIF and oncostatin 
M, as described in example 7. 



p F TA Tl FD P FSPRTPTION OF THF INVENTION 

15 The present invention provides a receptor comprising gpl30 covalently linked 

to leukemia inhibitory factor receptor (LIF-R). In another embodiment of the 
invention, the receptor comprises gpl30 non-covalently complexed with LIF-R. The 
receptor is capable of binding oncostatin M, and also binds leukemia inhibitory factor 
(LIF). The receptor thus is useful for treating disorders mediated by either oncostatin 

20 MorLIF. 

The gpl30 may be covalently linked to the LIF-R by any suitable means, such 
as via a cross-linking reagent or a polypeptide linker. The gp 130 and LIF-R proteins 
are covalently linked in a manner that does not interfere with the resulting receptor s 
ability to bind oncostatin M and LIF. In one embodiment of the invention, the receptor 
25 is a fusion protein produced by recombinant DNA technology. 

Non-covalent bonding of g P 130 to LIF-R may be achieved by any suitable 
means that does not interfere with the receptor's ability to bind oncostatin M and LIF. 
In one approach, a first compound is attached to LIF-R and a second compound that 
will non-covalently bond to the first compound is attached to gpl30. Examples of 
30 such compounds are biotin and avidin. The receptor is thus formed through the non- 
covalent interactions of biotin with avidin. In one embodiment of the invention, LIF-R 
and gpl30 are recombinant polypeptides, each purified from recombinant cells and then 
non-covalentlv bonded together to form the receptor. A host cell may be transformed 
with two different expression vectors such that both LIF-R and gpl30 are produced by 
35 the recombinant host cell. LIF-R and g P 130 (one or both of which are soluble 
fragments as described below) produced by such transformed host cells may associate 
to form a complex through non-covalent interactions. 
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"Leukemia inhibitory factor receptor" (LIF-R) refers to a protein (a cytokine 
receptor) that is present on the surface of various hematopoietic cells, including 
monocyte-macrophages and megakaryocytes, and on non-hematopoietic cells, 
including osteoblasts, placental trophoblasts, and liver parenchymal cells. LIF-R is 
5 capable of binding leukemia inhibitory factor (LIF) molecules and plays a role in 
transducing the signal provided by LIF to a cell. In the absence of any species 
designation, LIF-R refers generically to mammalian LIF-R, which includes, but is not 
limited to, human, murine, and bovine LIF-R. 

The cloning of human and murine leukemia inhibitory factor receptors (LIF-R), 
10 each a single polypeptide chain, has been reported by Gearing et al. in EMBO J. 
70:2839 (1991), which is hereby incorporated by reference in its entirety. The DNA 
sequence of a human LIF-R cDNA clone and the amino acid sequence encoded thereby 
are shown in SEQ ID NO: 5 and SEQ ID NO: 6. This cloned human cDNA encodes an 
N-terminal fragment of human LIF-R that includes (in order from N-terminus to C- 
15 terminus) a 44-amino acid signal sequence (amino acids -44 to -1), the entire 
extracellular region, a transmembrane region (the first amino acid of which is amino 
acid number 790 of SEQ ID NO: 5) and a portion of the cytoplasmic domain. The C- 
terminus of the fragment includes amino acids encoded by a poly-A segment and by a 
linker employed in vector construction, as described in Gearing et al., supra. The term 
20 "transmembrane region" as used herein refers to a string of hydrophobic amino acids 
positioned between the extracellular domain and the cytoplasmic domain of the protein. 
A plasmid vector containing the above-described cloned human LIF-R cDNA is 
designated pHLIFR-65 and has been deposited in E. coli host cells with the American 
Type Culture Collection on December 11, 1990 (ATCC accession no. 68491). The 
25 DNA and amino acid sequences of a full length native human LIF-R (determined by 
comparing the sequences of cDNA and genomic clones) have been reported by Gearing 
et al. supra and are presented herein in Figure 5. 

The LIF-R encoded by the cloned cDNA (SEQ ID NO: 6) contains the entire 
extracellular region of LIF-R (the domain believed to be responsible for the LIF- 
30 binding activity), and binds LIF, but with lower affinity than does a naturally occurring 
LIF receptor found on certain normal cells. Additionally, oncostatin M competes with 
LIF for binding to naturally occurring high affinity LIF receptors on certain cell types 
(Gearing et al., New Biologist, 4:61, 1992) but did not bind to the above-described 
cloned LIF-R expressed in COS cells. 
35 In order to investigate the possible existance of a high affinity converting 

subunit for the cloned single polypeptide chain LIF-R, host cells were co-transfected 
with the LIF-R encoding plasmid pHLIFR-65 and with pools from a human placental 
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cDNA library (also contained in an expression vector). The co-transfected cells were 
assayed for the ability to bind radiolabeled oncostatin M. 

A positive cDNA pool was subdivided and the procedure repeated to isolate a 
single cDNA clone designated B10G that conferred the ability to bind oncostatin M on 
5 cells co-transfected with B 10G and the LIF-R encoding plasmid pHLIFR-65. The co- 
transfected cells also were found to bind LIF with higher affinity than cells transfected 
with pHLIFR-65 alone. Host cells transfected with B10G alone exhibited low affinity' 
oncostatin M binding sites. The B10G cloned cDNA was sequenced and found to 
encode a protein that is known as gpl30. 

10 Thus, it has now been found that a receptor comprising both LIF-R and gpl30 

binds LIF with higher affinity than does the single-chain LIF-R polypeptide alone. The 
improved LIF binding of LIF-R in combination with gpl30 is described in example 1 
below and depicted in Figure 1 . 

Although LEF does not bind to either high- or low-affinity oncostatin M 

15 receptors, it has now been found that oncostatin M binds to the receptors of the present 
invention comprising LIF-R and gpl30. Oncostatin M binding is described in example 
2 below and depicted in Figure 2. 

A protein known as gpl30 has been purified from cellular sources that include 
placental tissue and a myeloma cell line U266. A number of additional cell types have 

20 been* found to express gpl30 mRNA, as reported by Hibi et aL, in Cell 63:1 149 
(1990). gpl30 has been reported to be involved in the formation of high affinity 
interleukin-6 binding sites and in IL-6 signal transduction (Hibi et al. supra). The 
cloning and expression of cDNA encoding a full length gpl30 protein has been 
reported by Hibi et al., supra, which is hereby incorporated by reference in its entirety. 

25 The DNA and deduced amino acid sequences reported by Hibi et al. for the gpl30 
cloned cDNA are presented herein in figure 6. The gpl30 amino acid sequence may 
vary from that reported by Hibi et al., e.g., leucine may be substituted for valine at 
position 8 in the signal sequence (numbering is as shown in Figure 6). This amino acid 
substitution may be attributable to genetic polymorphism (allelic variation among 

30 individuals producing the protein), and results from the presence of C rather than G at 
nucleotide position 22. 

As used herein, the term LIF-R includes variants and truncated forms of native 
LIF-R proteins that possess the desired LIF-binding or signal transducing activity. 
Likewise, the term gpl30 as used herein includes variants and truncated forms of the 

35 native gpl 30 protein that retain the desired biological activity. For gpl 30, the desired 
biological activity includes binding of oncostatin M; conferring on the inventive 
receptor the ability to bind oncostatin M: and increasing the affinity of the inventive 



WO 93/10151 




PCT/US92/10272 



receptor for LIF, compared to the LIF binding affinity of the single-chain LIF-R 
polypeptide alone. Specifically included are truncated, soluble or fusion forms of LIF- 
R and gpl30, as described below. Variants produced by adding, substituting, or 
deleting amino acid(s) in the native sequence are discussed in more detail below. 
5 One example of an LIF-R polypeptide that may be employed is that encoded by 

the cDNA clone designated pHLIF-R-65 (SEQ ID NO: 5), as described by Gearing et 
al., supra and in example 3 below. Alternatively, a fragment comprising amino acids 1 
to 945 of SEQ ED NO:5 may be employed. Amino acid 945 is the last LIF-R-specific 
amino acid of the polypeptide encoded by clone pHLIF-R-65, before the poly-A 

10 nucleotide segment believed to result from oligo(dT) priming at an internal site in the 
mRNA during preparation of the hLIF-R cDNA. (See Gearing et al., EMBO supra. 
at page 2840, column one.) 

Other examples of LEF-R polypeptides that may be employed in the inventive 
receptors include those lacking all or part of the transmembrane region or the 

15 cytoplasmic domain of the protein. Suitable LIF-R polypeptides thus include those 
containing amino acids 1-x or, when the signal sequence is not desired, amino acids 
45-x of the full length LIF-R sequence depicted in Figure 5, wherein x represents an 
integer from 833 to 1096. Amino acid number 833 is the last amino acid of the 
extracellular domain (i.e., before the start of the transmembrane region.) Polypeptides 

20 terminating in amino acid number 1096 lack the last C-terminal amino acid of the full 
length protein. The desirability of including the signal sequence depends on such 
factors as the position of LIF-R in a fusion protein, as discussed below, and the 
intended host cells when the receptor is to be produced via recombinant DNA 
technology. Note that the numbering of amino acids in Figure 5 (taken from Gearing et 

25 al., supra) differs from that of SEQ ED NO: 5 because the first amino acid of the signal 
sequence is designated amino acid number 1 in Figure 5 but is designated -44 in SEQ 
ID NO: 5. Other polypeptides may be chosen with regard to sequences that are 
conserved in the hematopoietin receptor family, (i.e., chosen to include the boxed 
sequence(s) shown in Figure 5.) 

30 One example of a suitable gpl30 polypeptide is that encoded by cDNA cloned 

into plasmid vector pDC303 to produce a plasmid designated B10G. The source of 
mRNA used in producing the cDNA was human placental tissue. Plasmid B10G in E. 
coli strain DH5a host cells was deposited with the American Type Culture Collection, 
Rockville, Maryland, on November 14, 1991, and assigned ATCC accession number 

35 68827. 

The DNA sequence of the gp 130 cDNA contained in plasmid B10G and the 
amino acid sequence of the gp 130 protein encoded by the cloned cDNA are presented in 
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SEQ ID NO: 1 and SEQ ID NO: 2. The prote.n comprises (in order from .he N- 
erlus to u,e C,erm,nus) a 22-am.no ac.d s.gna, sequence. ^ 
domain (amino acids 1-597). a u— brane reg.on (beg.nn.ng w.ti, am.no 
d a pan.al cyoplasm, doma.n (am.no acids 621-686,. T , 
5 polypeptide defers from .he equivalent pon.o„ of .he Hib. e. al. pro.e.n ,n that he 
amino acid of .he signal sequence is ieucine ra.her man vahne. as drscusseti 

abOVC ' Anomer example of a sunable r : JO polypeptide composes amino acids 1 ,0 
aos „f the SEO ID NO: 1, which includes all of the cysteine residues found .n the 
,0 1c it L doml of .he P ro,e.n, and ,so con.ns a complete Hbronecnn dom,. 
Additional examples of gp.30 polypeptides are those compn,„g ammo acds 1-98 

98-298 of SEQ ID NO: 1 . . . , 

Odrer gp.30 peptides laclang all or pan of the transmembrane reg.cn and/or 

„ c nling amino acids 1-x or, .hen .he signal seouence ,s no, des,^ nu a. 
23-x of me Figure 6 sequence, wherein x represents an .n.eger from 619 ,0 917 ^ The 
"s lino ac d of the rransmembrane region is tine alanine res.due a, posmon 620 ,n 
Fi uxTI Polypeptides .ermina.ing a, am.no acid 917 lack tire la, C-.erm.na ammo 
ITf 1 JTeng* protein presented ,n F.gure 6. Note „a, uhe numbenng «> 
20 acids in figure 6 (ti*en from Hibi e. al.. supra) differs from drat shown „ SEQ ID NO 
td NO^cause tine firs, amino acid of .he s,gual sequence is designate, am.no ac d 
umber I in Figure 6 bu, is designa.ed -22 in SEQ ID NO: 1. Reg-ons of tire ^.30 
protein corresponding .0 doma.ns that are conserved among cenam « are 
discussed by Hibi e. al, supra, a. page 1150. column 2, and page 1151, column L 
25 oler incited gplJO polypeptides chosen to include urese conserve, reg.ons may be 

""^Preferred UF-R and gp!30 polypeptides axe those wh.ch are soluble. In one 

embodiment of tire presen. rnvenuon, ,e recep.or compnses solub.e 

attached » soluble gp>30. "Soluble UF-R" as used in .he context of .he prese 

30 — defers to polypeptides U* are substantially similar .n am.no acs^ence 
all or par. of .he exrracellular region of a native UF-R and .ha., due to the lack of 
llembrane region tha, would cause re.en.ion of .he polypepude on a eel 
melL, are secreted upon cession. Tr,e soluble UF-R poKpepud, . , ™y . 
employed re.ain the abil.ty » b.nd LIF or, by compe.mve.y b.nd.ng UF ,nh,b,< L F 

35 s.gnal rransducion ac.vity v.a cell surface bound L1F-R pro.e.ns. Soluble LLF-R may 
; tlude pa, of ,he ransmembrane reg.on or pan of ,he cy.op.asnuc — 
o,her sequences, prov.ded .hat .he soluh.e UF-R P™* » capabie of be.ng secre.ed 
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Likewise, the term "soluble gpl 30 M as used herein refers to proteins that are 
substantially similar in amino acid sequence to all or part of the extracellular region of a 
native gpl 30 and are secreted upon expression but retain the desired biological activity. 
Soluble gpl 30 may include part of the transmembrane region, cytoplasmic domain, or 
5 other sequences, as long as the polypeptide is secreted. 

Soluble LEF-R and soluble gpl 30 may be identified (and distinguished from 
their non-soluble membrane-bound counterparts) by separating intact cells which 
express the desired protein from the culture medium, e.g., by centrifugation, and 
assaying the medium (supernatant) for the presence of the desired protein. The culture 

10 medium may be assayed using procedures which are similar or identical to those 
described in the examples below. The presence of LIF-R or gpl 30 in the medium 
indicates that the protein was secreted from the cells and thus is a soluble form of the 
desired protein. Soluble LIF-R and soluble gpl 30 may be naturally-occurring forms of 
these proteins. Cloning of a naturally-occurring soluble murine LEF-R is reported in 

15 Gearing et al., supra. Alternatively, soluble fragments of LIF-R and gpl30 proteins 
may be produced by recombinant DNA technology or otherwise isolated, as described 
below. 

The use of soluble forms of LIF-R and gpl 30 is advantageous for certain 
applications. Purification of the proteins from recombinant host cells is facilitated, 

20 since the soluble proteins are secreted from the cells. Further, soluble proteins are 
generally more suitable for intravenous administration and may exert their therapeutic 
effect (binding LEF and oncostatin M) in the bloodstream. 

Soluble LIF-R polypeptides include those comprising the signal sequence and 
entire extracellular domain (amino acids -44 to 789 of SEQ ED NO: 5) or lacking the 

25 signal sequence but containing the entire extracellular domain (amino acids 1 to 789 of 
SEQ ID NO: 5). Soluble gpl 30 polypeptides include those comprising the signal 
sequence and entire extracellular domain (amino acids -22 to 597 of SEQ ID NO: 1 ) or 
lacking the signal sequence but containing the entire extracellular domain (amino acids 1 
to 597 of SEQ ID NO: 1). The preparation and use of these soluble polypeptides in 

30 receptors of the present invention is described in examples 3-5. 

Other soluble LIF-Rs are truncated upstream of the transmembrane region, but 
preferably include that portion of the protein that contains the residues conserved 
among the members of the hematopoietin receptor family (shown boxed in Figure 5), 
i.e., amino acids 1 1-479 of SEQ ID NO:6. The N-terminus of such soluble LEF-Rs is 

35 any of amino acids 1-11 (or -44 when the native signal sequence is included), and the 
protein extends to a C-terminus selected from any of amino acids 479 through 789. 
Two such soluble proteins comprise amino acids -44 - 702, 1 - 702, -44 - 775, or 1 - 
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755 of SEQ ID NO:6. Constructs encoding these proteins may be prepared by 
techniques that involve cleaving the human LIF-R cDNA of the above-described clone 
P HLIFR-65 (ATCC 68491) with the restriction endonucleases Asp718 andXmnl or 
with Asp718 and Bspl286I. Asp 718 cleaves the vector upstream of the inserted LTP- 

5 R-encoding cDNA. Xmnl cleaves within the codon for Asp at position 702 
(generating blunt ends) and Bs P 1286I cleaves just 3' of the codon for Val at postnon 
775 of SEQ ID NO:5. If desired, an oligonucleotide may be ligated to the 3' end of the 
As P 718IBspl286l fragment to extend the LIF-R sequence, e.g., through ammo add 
number 789. An oligonucleotide also may be ligated to the 3' end of a LIF-R fragment 

10 to add the first two amino acids of the Fc polypeptide described in example 5, and a Bgl 
II site useful for attaching the rest of the Fc sequence downstream of the LIF-R 

sequence. fCT:n 
Additional soluble human LIF-Rs comprise amino acids 1-678 or 1-680 of bbq 
ID NO-6 When the human and murine LIF-R amino acid sequences disclosed in 
15 Gearing et al., EMBO J., supra, are aligned (with gaps introduced to maximize identity 
between the two sequences), amino acid 680 of the human sequence is aligned with the 
last amino acid of the munne protein, and amino acid 678 is the last amino acd of the 
human sequence that is identical to a corresponding amino acid in the murine sequence. 
Since the murine protein binds LIF, the murine LIF-R contains that portion of the 
20 protein required for LIF binding. 

An additional example of a soluble gpl30 polypeptide comprises amino acids - 
22 to 582 of SEQ ID NO:2. An expression vector encoding such a protein was 
constructed in example 7. Soluble LIF-R and gpl30 polypeptides also include those 
from which fibroneenn type III (FNTJI) domains have been deleted. From one to all of 
25 the FNII1 domains may be deleted, providing the advantage of reducing the size of the 
protein Preparation of such LIF-R and gpl30 proteins is described in example 8. 

Truncated LIF-R and gpl30, including soluble polypeptides, may be prepared 
by any of a number of conventional techniques. In the case of recombinant proteins, a 
DNA fragment encoding a desired fragment may be subcloned into an expression 
30 vector. Alternatively, a desired DNA sequence may be chemically synthesized using 
known techniques. DNA fragments also may be produced by restriction endonuclease 
digestion of a full length cloned DNA sequence, and isolated by electrophoresis on 
agarose gels. Linkers containing restriction endonuclease cleavage site(s . may be 
emploved to insert the desired DNA fragment into an expression vector, or the fragment 
35 mav be digested at cleavage sues naturally present therein. Alternatively, prote.ns may 
be fragmented using proteolytic enzymes, for example, and the desired truncal 
polypeptide isolated from the digestion mixture using reversed prase HPLC. 
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The well known polymerase chain reaction procedure also may be employed to 
isolate a DNA sequence encoding a desired protein fragment. This technique is 
illustrated in examples 3-5 below. 

In another approach, enzymatic treatment (e.g., using Bal 31 exonuclease) may 
5 be employed to delete terminal nucleotides from a DNA fragment to obtain a fragment 
having a particular desired terminus. Among the commercially available linkers are 
those that can be ligated to the blunt ends produced by Bal 31 digestion, and which 
contain restriction endonuclease cleavage site(s). Alternatively, oligonucleotides that 
reconstruct the N- or C- terminus of a DNA fragment to a desired point may be 
10 synthesized. The oligonucleotide may contain a restriction endonuclease cleavage site 
upstream of the desired coding sequence and position an initiation codon (ATG) at the 
N-terminus of the coding sequence. 

The gpl30 polypeptide is attached to the LEF-R polypeptide through a covalent 
or non-covalent linkage. Covalent attachment is preferred for certain applications, e.g. 
15 in vivo use, in view of the enhanced stability generally conferred by covalent, as 
opposed to non-covalent, bonds. In constructing the receptor of the present invention, 
covalent linkage may be accomplished via cross-linking reagents, polypeptide linkers, 
or any other suitable technique. 



20 known. Heterobifunctional and homobifunctional linkers are available for this purpose 
from Pierce Chemical Company, Rockford, Illinois, for example. Such linkers contain 
two functional groups (e.g., esters and/or maleimides) that will react with certain 
functional groups on amino acid side chains, thus linking one polypeptide to another. 
The reagent and reaction conditions should be chosen such that the cross-linking does 

25 not interfere with binding of oncostatin M and LIF to the receptor. 

One type of polypeptide linker that may be employed in the present invention 
separates gpl30 and LEF-R domains by a distance sufficient to ensure that each domain 
properly folds into the secondary and tertiary structures necessary for the desired 
biological activity. The linker also should allow the extracellular domains of gpl30 and 

30 LIF-R to assume the proper spatial orientation to form the binding site for oncostatin M 
and LIF. Suitable polypeptide linkers preferably (1) will adopt a flexible extended 
conformation, (2) will not exhibit a propensity for developing an ordered secondary 
structure which could interact with the functional gpl30 and LIF-R domains, and (3) 
will have minimal hydrophobic or charged character which could promote interaction 

35 with the functional protein domains. Typical surface amino acids in flexible protein 
regions include Gly, Asn and Ser. Virtually any permutation of amino acid sequences 
containing Gly, Asn and Ser would be expected to satisfy the above criteria for a 
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peptide linker sequence. Other near neutral amino acids, such as Thr and Ala, may also 
be used in the linker sequence. Examples of such polypeptide tinkers are presented 
below. 

Another type of polypeptide linker that may be employed compnses the Fc 
5 region of an antibody. An Fc polypeptide is attached to the C-terminus of LIF-R or of 
the LIF-R fragment. A separate Fc polypeptide is attached to the C-terminus of gpl30 
or of the gpl30 fragment. The two resulting polypeptide chains may be combined in a 
buffered solution, whereupon disulfide bonds form between the two Fc polypeptides 
(c g in the so-called hinge region, where interchain disulfide bonds are normally 
10 present in antibody molecules). Preferably, a host cell is transformed with DNA 
encoding both polypeptides such that the two polypeptides are co-expressed and 
interchain disulfide bonds form in the host cell. LIF-R is thus covalently linked to 
gp!30 via the disulfide bonds in the linker portion of the receptor. Procedures for 
isolating the Fc region of an antibody are well-known and include proteolytic digestion 
1 5 with papain. Alternatively, an Fc polypeptide may be produced by recombinant cells or 
chemically synthesized. Also useful are N-terminal fragments of an antibody Fc region 
that contain the cysteine residues involved in disulfide bond formation at the hinge 
region One example of a receptor containing an Fc polypeptide linker is illustrated in 
example 5 below. The receptor is depicted in Figure 4. The number and posmon of 
20 disulfide bonds may vary from those shown in Figure 4. 

Additional examples of LIF-R/Fc and gpl30/Fc fusion proteins useful in 
preparing receptors of the present invention are described in examples 7 and 8. 
Advantageously, host cells are co-transfected with two different expression vectors, 
one encoding soluble LIF-R/Fc and the other encoding soluble gpl30/Fc. The 
25 heterodimer is believed to form intracellularly or during secretion. 

Homodimers comprising two LIF-R/Fc polypeptides or two g P 130/Fc 
polypeptides linked via disulfide bonds are also produced by certain of the transfected 
host cells disclosed herein. The LIF-R/Fc homodimers are useful for binding LIF and 
the gpl30/Fc homodimers find use in binding oncostatin M. The homodimers may be 
30 separated from each other and from the heterodimer by virtue of differences in size 
(e.g., by gel electrophoresis). The heterodimer also may be purified by sequential 
immunoaffinitv chromatography (described below). 

In an alternative embodiment, a first fusion polypepude comprising gpl30 (or 
fragment thereof) upstream of an antibody light chain (or a fragment thereof) is 
35 prepared A second second fusion polypeptide compnses LIF-R upstream of an 
amibodv heaw chain (or a heavy chain fragment, the N-terminus of which extends at 
least throueh the C H l region. Disulfide bondts. form, between the gpl30-lighi chain 
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fusion polypeptide and the LIF-R-heavy chain fusion polypeptide, thus producing a 
receptor of the present invention comprising a polypeptide linker. If desired, a third 
fusion (an LIF-R- antibody light chain fusion polypeptide) is prepared and combined 
with (disulfide bonded to) a fourth fusion comprising gpl30 fused to an antibody 
5 heavy chain. When the two disulfide bonded molecules are combined, additional 
disulfide bonds form between the two Fc regions. The resulting receptor of the present 
invention comprising the four fusion polypeptides resembles an antibody in structure 
and displays the oncostatin M/LEF binding site bivalently. 

A polypeptide linker may be attached to gpl30 and to LIF-R by any of the 

10 conventional procedures used to attach one polypeptide to another. The cross-linking 
reagents available from Pierce Chemical Company as described above are among those 
that may be employed. Amino acids having side chains reactive with such reagents 
may be included in the polypeptide linker, e.g., at the termini thereof. 

The gpl30 and LIF-R polypeptides may be separately purified from cellular 

15 sources, and then linked together. Alternatively, the receptor of the present invention 
may be produced using recombinant DNA technology. The gpl30 and LIF-R 
polypeptides may be produced separately and purified from transformed host cells for 
subsequent covalent linkage. In one embodiment of the present invention, a host cell is 
transformed/transfected with foreign DNA that encodes gpl30 and LEF-R as separate 

20 polypeptides. The two polypeptides may be encoded by the same expression vector 
with start and stop codons for each of the two genes, or the recombinant cells may be 
co-transfected with two separate expression vectors. In another embodiment, the 
receptor is produced as a fusion protein in recombinant cells. 

In one embodiment of the present invention, the receptor protein is a 

25 recombinant fusion protein of the formula: 



Rl-L-R2or R2-L-R1 

wherein Rj represents gpl30 or a gpl30 fragment; R2 represents LIF-R or an LIF-R 
fragment; and L represents a polypeptide linker. 

30 The fusion proteins of the present invention include constructs in which the C- 

terminal portion of gpl30 is fused to the linker which is fused to the N-terminal portion 
of LIF-R, and also constructs in which the C-terminal portion of LIF-R is fused to the 
linker which is fused to the N-terminal portion of gpl30. gpl30 is covalently linked to 
LIF-R in such a manner as to produce a single protein which retains the desired 

35 biological activities of gpl30 and LIF-R. The components of the fusion protein are 
listed in their order of occurrence (i.e., the N-terminal polypeptide is listed first, 
followed by the linker and then the C-termma] polypeptide). 
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A DNA sequence encoding a fusion protein is constructed using recombinant 
DNA techniques to insert separate DNA fragments encoding gpl30 and LEF-R into an 
appropriate expression vector. The 3' end of a DNA fragment encoding gpl30 is 
ligated (via the linker) to the 5' end of the DNA fragment encoding LEF-R with the 
5 reading frames of the sequences in phase to permit translation of the mRNA into a 
single biologically active fusion protein. Alternatively, the 3' end of a DNA fragment 
encoding LIF-R may be ligated (via the linker) to the 5' end of the DNA fragment 
encoding gpl30, with the reading frames of the sequences in phase to permit translation 
of the mRNA into a single biologically active fusion protein. A DNA sequence 

10 encoding an N-terminal signal sequence may be retained on the DNA sequence 
encoding the N-terminal polypeptide, while stop codons, which would prevent read- 
through to the second (C-terminal) DNA sequence, are eliminated. Conversely, a stop 
codon required to end translation is retained on the second DNA sequence. DNA 
encoding a signal sequence is preferably removed from the DNA sequence encoding the 

1 5 C-terminal polypeptide. 

Suitable polypeptide linkers comprise a chain of amino acids, preferably from 
20 to 100 amino acids in length and most preferably from 30 to 60 amino acids in 
length. As discussed above, the linker advantageously comprises amino acids selected 
from the group consisting of glycine, asparagine, serine, threonine, and alanine. 

20 Examples of suitable polypeptide linkers include, but are not limited to, (Gly4Ser) n , 
wherein n is 4-12, preferably 8, and (Gly4SerGly5Ser)2. 

A DNA sequence encoding a desired polypeptide linker may be inserted 
between, and in the same reading frame as, the DNA sequences encoding gpl30 and 
LIF-R using any suitable conventional technique. For example, a chemically 

25 synthesized oligonucleotide encoding the linker and containing appropriate restriction 
endonuclease cleavage sites may be ligated between the sequences encoding gpl30 and 
LIF-R. 

Alternatively, a chemically synthesized DNA sequence may contain a sequence 
complementary to the 3' terminus (without the stop codon) of either gpl30 or LIF-R, 

30 followed by a linker-encoding sequence which is followed by a sequence 
complementary to the 5' terminus of the other of gpl30 and LEF-R. Oligonucleotide 
directed mutagenesis is then employed to insert the linker-encoding sequence into a 
vector containing a direct fusion of gpl 30 and LEF-R. 

The present invention provides an isolated DNA sequence encoding the above- 

35 described fusion protein comprising gpl30, LIF-R, and a polypeptide linker, and also 
provides recombinant expression vectors containing the isolated DNA sequence. 
"Expression vector" refers to a repiicable DNA construe: used to express DNA which 
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encodes the desired protein and which includes a transcriptional unit comprising an 
assembly of (1) genetic element(s) having a regulatory role in gene expression, for 
example, promoters, operators, or enhancers, operatively linked to (2) a DNA sequence 
encoding a desired protein (in this case, a receptor of the present invention) which is 
5 transcribed into mRNA and translated into protein, and (3) appropriate transcription and 
translation initiation and termination sequences. The choice of promoter and other 
regulatory elements generally varies according to the intended host cell. 

Proteins to be produced in yeast expression systems preferably include a leader 
sequence enabling extracellular secretion of translated protein by the yeast host cell. 
10 Alternatively, where recombinant protein is expressed without a leader or transport 
sequence, it may include an N-terminal methionine residue. This residue optionally 
may be subsequently cleaved from the expressed recombinant protein to provide a final 
product. 

In the expression vectors, regulatory elements controlling transcription or 

15 translation are generally derived from mammalian, microbial, viral or insect genes. The 
ability to replicate in a host, usually conferred by an origin of replication, and a 
selection gene to facilitate recognition of transformants may additionally be 
incorporated. Vectors derived from retroviruses also may be employed. 

DNA regions are operably linked when they are functionally related to each 

20 other. For example, DNA encoding a signal peptide (secretory leader) is operably 
linked to DNA for a polypeptide if the polypeptide is expressed as a precursor that is 
secreted through the host cell membrane; a promoter is operably linked to a coding 
sequence if it controls the transcription of the sequence; or a ribosome binding site is 
operably linked to a coding sequence if it is positioned so as to permit translation. 

25 Generally, "operably linked" means contiguous and, in the case of secretory leaders, 
contiguous and in reading frame. 

Transformed host cells are cells which have been transformed or transfected 
with foreign DNA using recombinant DNA techniques. In the context of the present 
invention, the foreign DNA includes a sequence encoding the inventive receptor. Host 

30 cells may be transformed for purposes of cloning or amplifying the foreign DNA, or 
may be transformed with an expression vector for production of the receptor protein. 
Suitable host cells for expression of the receptor include prokaryotes, yeast or higher 
eukaryotic cells under the control of appropriate promoters. Prokaryotes include gram 
negative or gram positive organisms, for example E. coli or bacilli. Prokaryotic 

35 expression vectors generally comprise one or more phenotypic selectable markers, for 
example a gene encoding proteins conferring antibiotic resistance or supplying an 
autotrophic requirement, and an ongin of replication recognized by the host to ensure 
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amplification within the host. Examples of suitable prokaryotic hosts for 
transformation include £. coli, Bacillus subtilis, Salmonella typhimurium, and various 
species within the genera Pseudomonas, Streptomyces, and Staphylococcus, although 
others may also be employed as a matter of choice. Higher eukaryotic cells include 
5 established cell lines of mammalian origin. Cell-free translation systems could also be 
employed to produce fusion protein using RNAs derived from the DNA constructs of 
the present invention. Appropriate cloning and expression vectors for use with 
bacterial, fungal, yeast, and mammalian cellular hosts are described by Pouwels et al. 
(Cloning Vectors: A Laboratory Manual, Elsevier, New York, 1985), the relevant 

10 disclosure of which is hereby incorporated by reference. 

Useful expression vectors for bacterial use can comprise a selectable marker and 
bacterial origin of replication derived from commercially available plasmids comprising 
genetic elements of the well-known cloning vector pBR322 (ATCC 37017). Such 
commercial vectors include, for example, pKK223-3 (Pharmacia Fine Chemicals, 

15 Uppsala, Sweden) and pGEMl (Promega Biotec, Madison, WI, USA). These 
pBR322 "backbone" sections are combined with an appropriate promoter and the 
structural sequence to be expressed. £. coli is typically transformed using derivatives 
of pBR322, a plasmid derived from an E. coli species (Bolivar et al., Gene 2:95, 
1977). pBR322 contains genes for ampicillin and tetracycline resistance and this 

20 provides simple means for identifying transformed cells. 

Promoters commonly used in recombinant microbial expression vectors include 
the b-lactamase (penicillinase) and lactose promoter system (Chang et al., Nature 
275:615, 1978; and Goeddel et al., Nature 257:544, 1979), the tryptophan (trp) 
promoter system (Goeddel et al., NucL Acids Res. S:4057, 1980; and EPA 36,776) 

25 and tac promoter (Maniatis, Molecular Cloning: A Laboratory Manual, Cold Spring 
Harbor Laboratory, p. 412, 1982). A particularly useful bacterial expression system 
employs the phage X Pl promoter and cI857ts thermoinducibie repressor. Plasmid 
vectors available from the American Type Culture Collection which incorporate 
derivatives of the X Pl promoter include plasmid pHUB2, resident in E. coli strain 

30 JMB9 (ATCC 37092) and pPLc28, resident in E. coli RR1 (ATCC 53082). 

The recombinant receptor protein may also be expressed in yeast hosts, 
preferably from Saccharomyces species, such as 5. cerevisiae. Yeast of other genera 
such as Pichia or Kluyxeromyces may also be employed. Yeast vectors will generally 
contain an origin of replication from the 2|im yeast plasmid or an autonomously 

35 replicating sequence (ARS), a promoter, DNA encoding the receptor fusion protein, 
sequences for polyadenylation and transcription termination and a selection gene. 
Preferably, yeas: vectors will include an ongin of replication and selectable markers 
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permitting transformation of both yeast and E. coli, e.g., the ampicillin resistance gene 
of E. coli and the S. cerevisiae trpl gene, which provides a selection marker for a 
mutant strain of yeast lacking the ability to grow in tryptophan, and a promoter derived 
from a highly expressed yeast gene to induce transcription of a structural sequence 
5 downstream. The presence of the trpl lesion in the yeast host cell genome then 
provides an effective environment for detecting transformation by growth in the 
absence of tryptophan. 

Suitable promoter sequences in yeast vectors include the promoters for 
metallothionein, 3-phosphoglycerate kinase (Hitzeman et al., 7. Biol. Chem. 255:2073, 

10 1980) or other glycolytic enzymes (Hess et al., J. Adv. Enzyme Reg. 7:149, 1968; and 
Holland et al., Biochem. 77:4900, 1978), such as enolase, glyceraldehyde-3-phosphate 
dehydrogenase, hexokinase, pyruvate decarboxylase, phosphofructokinase, glucoses- 
phosphate isomerase, 3-phosphoglycerate mutase, pyruvate kinase, triosephosphate 
isomerase, phosphoglucose isomerase and glucokinase. Suitable vectors and 

15 promoters for use in yeast expression are further described in R. Hitzeman et al., EPA 
73,657. 

Preferred yeast vectors can be assembled using DNA sequences from pBR322 
for selection and replication in E. coli (Amp r gene and origin of replication) and yeast 
DNA sequences including a glucose-repressible ADH2 promoter and a-factor secretion 

20 leader. The ADH2 promoter has been described by Russell et al. (J. Bioi Chem. 
255:2674, 1982) and Beier et al., (Nature 500:724, 1982). The yeast a-factor leader, 
which directs secretion of heterologous proteins, can be inserted between the promoter 
and the structural gene to be expressed. See, e.g., Kurjan et al., Cell 50:922, 1982; 
and Bitter et al., Proc. Natl. Acad. Sci. USA 57:5330, 1984. The leader sequence may 

25 be modified to contain, near its 3' end, one or more useful restriction sites to facilitate 
fusion of the leader sequence to foreign genes. 

Suitable yeast transformation protocols are known to those of skill in the art. 
An exemplary technique is described by Hinnen et al., Proc. Natl. Acad. Sci. USA 
75:1929, (1978), selecting for Trp + transformants in a selective medium consisting of 

30 0.67% yeast nitrogen base, 0.5% casamino acids, 2% glucose, 10 ^lg/ml adenine and 
20 M-g/ml uracil. 

Host strains transformed by vectors comprising the ADH2 promoter may be 
grown for expression in a rich medium consisting of 1% yeast extract, 2% peptone, 
and 1% glucose supplemented with 80 ng/ml adenine and 80 [ig/m\ uracil. 
35 Derepression of the ADH2 promoter occurs upon exhaustion of medium glucose. 
Crude yeast supernatants are harvested by filtration and held at 4*C prior to further 
purification. 
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Various mammalian or insect cell culture systems can be employed to express 
recombinant protein. Baculovirus systems for production of heterologous proteins in 
insect cells are reviewed by Luckow and Summers, Bio/Technology 6:47 (1988). 
Examples of suitable mammalian host cell lines include the COS-7 lines of monkey 
5 kidney cells, described by Gluzman (Cell 25:175, 1981), and other cell lines capable of 
expressing an appropriate vector including, for examples, L cells, C127, 3T3, Chinese 
hamster ovary (CHO), HeLa and BHK cell lines. Mammalian expression vectors may 
comprise non -transcribed elements such as an origin of replication, a suitable promoter 
and enhancer linked to the gene to be expressed, and other 5' or 3' flanking 

10 nontranscribed sequences, and 5' or 3' nontranslated sequences, such as necessary 
ribosome binding sites, a poly-adenylation site, splice donor and acceptor sites, and 
transcriptional termination sequences. 

The transcriptional and translational control sequences in expression vectors to 
be used in transforming vertebrate cel!^ may be provided by viral sources. For 

15 example, commonly used promoters \d enhancers are derived from Polyoma, 
Adenovirus 2, Simian Virus 40 (SV40), and human cytomegalovirus. DNA sequences 
derived from the SV40 viral genome, for example, SV40 origin, early and late 
promoter, enhancer, splice, and polyadenylation sites may be used to provide the other 
genetic elements required for expression of a heterologous DNA sequence. The early 

20 and late promoters are particularly useful because both are obtained easily from the 
virus as a fragment which also contains the SV40 viral origin or replication (Tiers et al., 
Nature 273: 113, 1978). Smaller or larger SV40 fragments may also be used, provided 
the approximately 250 bp sequence extending from the Hind III site toward the BgH 
site located in the viral origin of replication is included. Exemplary vectors can be 

25 constructed as disclosed by Okayama and Berg (Mol Cell. Biol 5:280, 1983). 

A useful system for stable high level expression of mammalian receptor cDNAs 
in CI 27 murine mammary epithelial cells can be constructed substantially as described 
by Cosman et al. {Mol Immunol 25:935, 1986). 

Particularly preferred vectors for expression of the inventive receptor as a 

30 fusion protein are described in the examples below. The foregoing discussion is, of 
course, applicable to the production of recombinant fusion proteins comprising a 
fragment of gpl30 and/or a fragment of LIF-R. Suitable fragments are discussed 
above, and DNA sequences encoding such fragments may be inserted into the above- 
described expression vectors. 

35 The present invention provides a process for preparing the recombinani receptor 

of the present invention, comprising cultunng a host cell transformed with an 
expression vector comprising a DNA sequence thai encodes said receptor under 
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conditions that promote expression. The receptor is then purified from culture media or 
cell extracts. 

For example, supernatants from systems which secrete recombinant protein into 
the culture medium can be first concentrated using a commercially available protein 
5 concentration filter, for example, an Amicon or Millipore Pellicon ultrafiltration unit. 
Following the concentration step, the concentrate can be applied to a suitable 
purification matrix. For example, a suitable affinity matrix can comprise LEF or OSM. 
An LIF affinity matrix may be prepared by coupling recombinant human LIF to 
cyanogen bromide-activated Sepharose (Pharmacia) or Hydrazide Affigel (Biorad), 

10 according to manufacturer's recommendations. Sequential immunopurification using 
antibodies bound to a suitable support is preferred. Proteins binding to an antibody 
specific for LEF-R are recovered and contacted with antibody specific for gp 130 on an 
insoluble support. Proteins immunoreactive with both antibodies may thus be 
identified and isolated. Alternatively, an anion exchange resin can be employed, for 

15 example, a matrix or substrate having pendant diethylaminoethyl (DEAE) groups. The 
matrices can be acrylamide, agarose, dextran, cellulose or other types commonly 
employed in protein purification. Alternatively, a cation exchange step can be 
employed. Suitable cation exchangers include various insoluble matrices comprising 
sulfopropyl or carboxymethyl groups. Sulfopropyl groups are preferred. One or more 

20 reversed-phase high performance liquid chromatography (RP-HPLC) steps employing 
hydrophobic RP-HPLC media, e.g., silica gel having pendant methyl or other aliphatic 
groups, can be employed to further purify a fusion protein composition. 

Some or all of the foregoing purification steps, in various combinations, can be 
employed to provide an essentially homogeneous recombinant protein. Recombinant 

25 cell culture enables the production of the fusion protein free of those contaminating 
proteins which may be normally associated with gpl30 or LIF-R as they are found in 
nature in their respective species of origin, e.g., in cells, cell exudates or body fluids. 

The foregoing purification procedures are among those that may be employed to 
purify non-recombinant receptors of the present invention as well. When linking 

30 procedures that may produce homodimers (gpl30-linker-gpl30 and LIF-R-linker-LIF- 
R) are employed, purification procedures that separate the desired heterodimer from 
such homodimers are employed. An example of such a procedure is sequential 
immunopurification as discussed above. 

Recombinant protein produced in bacterial culture is usually isolated by initial 

35 extraction from cell pellets, followed by one or more concentration, salting-out, 
aqueous ion exchange or size exclusion chromatography steps. Finally, high 
performance liquid chromatography (HPLC) can be employed for final purification 
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steps. Microbial cells employed in expression of recombinant fusion proteins can 
disrupted by any convenient method, including freeze-thaw cycling, sonication, 
mechanical disruption, or use of cell lysing agents. 

Fermentation of yeast which express fusion proteins as a secreted protein 
5 greatly simplifies purification. Secreted recombinant protein resulting from a large- 
scale fermentation can be purified by methods analogous to those disclosed by Urdal et 
al. (J. Chromatog. 296:171, 1984), involving two sequential, reversed-phase HPLC 
steps for purification of a recombinant protein on a preparative HPLC column. 

The present invention also provides a pharmaceutical composition comprising a 

10 receptor protein of the present invention with a physiologically acceptable carrier or 
diluent. Such carriers and diluents will be nontoxic to recipients at the dosages and 
concentrations employed. Such compositions may, for example, comprise the receptor 
protein in a buffered solution, to which may be added antioxidants such as ascorbic 
acid, low molecular weight (less than about ten residues) polypeptides, proteins, amino 

15 acids, carbohydrates including glucose, sucrose or dextrins, chelating agents such as 
EDTA, glutathione and other stabilizers and excipients. The receptor of the present 
invention may be administered by any suitable method in a manner appropriate to the 
indication, such as intravenous injection, continuous infusion, sustained release from 
implants, etc. 

20 The DNA and/or amino acid sequences of gpl30 and LEF-R may vary from 

those presented in SEQ ED NO: 1 and SEQ ED NO: 5. Due to the known degeneracy of 
the genetic code, there can be considerable variation in nucleotide sequences encoding 
the same amino acid sequence. The DNA sequences capable of hybridizing to the 
native DNA sequence of SEQ ED NO: 1 or SEQ ID NO: 5 under moderately stringent 

25 conditions (50*C, 2 X SSC), and which encode a biologically active gpl30 or LEF-R 
polypeptide, are also considered to be gpl30-encoding or LIF-R-encoding DNA 
sequences, respectively, in the context of the present invention. Further, certain 
mutations in a nucleotide sequence which encodes LIF-R or gpl30 will not be 
expressed in the final protein product. For example, nucleotide substitutions may be 

30 made to enhance expression, primarily to avoid secondary structure loops in the 
transcribed mRNA (see EPA 75,444A, incorporated herein by reference). Other 
alterations of the nucleotide sequence may be made to provide codons that are more 
readily translated by the selected host, e.g., the well-known E. coli preference codons 
for £. coli expression. 

35 The amino acid sequence of native gp 1 30 or LIF-R may be varied by 

substituting, deleting, adding, or inserting one or more amino acids to produce a gp 1 30 

or LIF-R variant. Variants that possess the desired biological activity of the native 
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gpl30 and LIF-R proteins may be employed in the receptor of the present invention. 
Assays by which the biological activity of variant proteins may be analyzed are 
described in the examples below. 

Alterations to the native amino acid sequence may be accomplished by any of a 

5 number of known techniques. For example, mutations can be introduced at particular 
loci by synthesizing oligonucleotides containing a mutant sequence, flanked by 
restriction sites enabling ligation to fragments of the native sequence. Following 
ligation, the resulting reconstructed sequence encodes an analog having the desired 
amino acid insertion, substitution, or deletion. 

10 Alternatively, oligonucleotide-directed site-specific mutagenesis procedures can 

be employed to provide an altered gene having particular codons altered according to 
the substitution, deletion, or insertion required. Exemplary methods of making the 
alterations set forth above are disclosed by Walder et al. (Gene 42:133 , 1986); Bauer et 
al. (Gene 37:73, 1985); Craig (BioTechniques, January 1985, 12-19); Smith et al. 

15 (Genetic Engineering: Principles and Methods, Plenum Press, 1981); U.S. Patent No. 
4,518,584, and U.S. Patent No. 4,737,462, which are incorporated by reference 
herein. 

Bioequivalent variants of LIF-R and gpl30 may be constructed by, for 
example, making various substitutions of amino acid residues or deleting terminal or 

20 internal amino acids not needed for biological activity. In one embodiment of the 
invention, the variant amino acid sequence is at least 80% identical, preferably at least 
90% identical, to the native sequence. Percent similarity may be determined, for 
example, by comparing sequence information using the GAP computer program, 
version 6.0, available from the University of Wisconsin Genetics Computer Group 

25 (UWGCG). The GAP program utilizes the alignment method of Needleman and 
Wunsch (J. MoL Biol. 48:443, 1970), as revised by Smith and Waterman (Adv. Appl. 
Math. 2:482, 1981). Briefly, the GAP program defines similarity as the number of 
aligned symbols (i.e., nucleotides or amino acids) which are similar, divided by the 
total number of symbols in the shorter of the two sequences. The preferred default 

30 parameters for the GAP program include: (1) a unary comparison matrix (containing a 
value of 1 for identities and 0 for non-identities) for nucleotides, and the weighted 
comparison matrix of Gribskov and Burgess, NucL Acids Res. 74:6745, 1986, as 
described by Schwartz and Dayhoff, eds., Atlas of Protein Sequence and Structure, 
National Biomedical Research Foundation, pp. 353-358, 1979; (2) a penalty of 3.0 for 

35 each gap and an additional 0.10 penalty for each symbol in each gap: and (3 ) no penalty 
for end gaps. 
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Generally, substitutions should be made conservatively; i.e., the most preferred 
substitute amino acids are those having physiochemical characteristics resembling those 
of the residue to be replaced. Examples of conservative substitutions include 
substitution of one aliphatic residue for another, such as Be, Val, Leu, or Ala for one 
5 another, or substitutions of one polar residue for another, such as between Lys and 
Arg; Glu and Asp; or Gin and Asn. Other such conservative substitutions, for 
example, substitutions of entire regions having similar hydrophobicity characteristics, 
are well known. Moreover, particular amino acid differences between human, murine 
and other mammalian LIF-Rs is suggestive of additional conservative substitutions that 

10 may be made without altering the essential biological characteristics of LIF-R. 

Cysteine residues can be deleted or replaced with other amino acids to prevent 
formation of unnecessary or incorrect intramolecular disulfide bridges upon 
renaturation. Adjacent dibasic amino acid residues may be modified to enhance 
expression in yeast systems in which KEX2 protease activity is present. 

15 EP212,914 discloses the use of site-specific mutagenesis to inactivate KEX2 

protease processing sites in a protein. KEX2 protease processing sites are inactivated 
by deleting, adding or substituting residues to alter Arg-Arg, Arg-Lys, and Lys-Arg 
pairs to eliminate the occurrence of these adjacent basic residues. Lys-Lys pairings are 
considerably less susceptible to KEX2 cleavage, and conversion of Arg-Lys or Lys- 

20 Arg to Lys-Lys represents a conservative and preferred approach to inactivating KEX2 
sites. 

Hydrophilic amino acids may be substituted for hydrophobic amino acids in the 
transmembrane region and/or intracellular domain of gpl30 and LIF-R to enhance 
water solubility of the proteins. Addition of amino acids to the native sequence may 
25 result from translation of in-frame codons present in linkers used in constructing 
cloning or expression vectors. The LIF-R encoded by clone pHLIF-R-65 contains 
such linker-encoded amino acids at the C-terminus, as described by Gearing et al, 
supra. 

The present invention also includes proteins with or without associated native- 
30 pattern glycosylation. Expression of DNAs encoding the fusion proteins in bacteria 
such as E. coli provides non-glycosylated molecules. Functional mutant analogs 
having inactivated N-glycosylation sites can be produced by oligonucleotide synthesis 
and ligation or by site-specific mutagenesis techniques. These analog proteins can be 
produced in a homogeneous, reduced-carbohydrate form in good yield using yeast 
35 expression systems. N-glycosylation sites in eukaryotic proteins are characterized by 
the amino acid triplet Asn-Aj-Z, where Al is any amino acid except Pro, and Z is Se: 
or Thr. In this sequence, asparagine provides a side chain amino group for covalen: 
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attachment of carbohydrate. Such a site can be eliminated by substituting another 
amino acid for Asn or for residue Z, deleting Asn or Z, or inserting a non-Z amino acid 
between A\ and Z, or an amino acid other than Asn between Asn and Ai. Known 
procedures for inactivating N-glycosylation sites in proteins include those described in 

5 U.S. Patent 5,071,972 and EP 276,846. 

Variants of the receptor proteins of the present invention also include various 
structural forms of the primary protein which retain biological activity. Due to the 
presence of ionizable amino and carboxyl groups, for example, a receptor protein may 
be in the form of acidic or basic salts, or may be in neutral form. Individual amino acid 

10 residues may also be modified by oxidation or reduction. 

The primary amino acid structure also may be modified by forming covalent or 
aggregative conjugates with other chemical moieties, such as glycosyl groups, lipids, 
phosphate, acetyl groups and the like. Covalent derivatives are prepared by linking 
particular functional groups to amino acid side chains or at the N- or C- termini. Other 

15 derivatives of the receptor protein within the scope of this invention include covalent or 
aggregative conjugates of the receptor protein with other proteins or polypeptides, such 
as by synthesis in recombinant culture as N- or C- terminal fusions. For example, the 
conjugated polypeptide may be a signal (or leader) polypeptide sequence at the N- 
terminal region of the protein which co-translationally or post-translationally directs 

20 transfer of the protein from its site of synthesis to its site of function inside or outside 
of the cell membrane or wall (e.g., the yeast a-factor leader). Peptides may also be 
added to facilitate purification or identification of the fusion protein (e.g., poly-His). 
The amino acid sequence of the fusion protein can also be linked to the peptide Asp- 
Tyr-Lys-Asp-Asp-Asp-Asp-Lys (DYKDDDDK) (Hopp et al., Bio/Technology 6:1204, 

25 1988) The latter sequence is highly antigenic and provides an epitope reversibly bound 
by a specific monoclonal antibody, enabling rapid assay and facile purification of 
expressed recombinant protein. This sequence is also specifically cleaved by bovine 
mucosal enterokinase at the residue immediately following the Asp-Lys pairing. 
Receptor proteins capped with this peptide may also be resistant to intracellular 

30 degradation in E. coli. 

The receptors of the present invention are primarily useful as oncostatin M 
binding reagents, and may be administered in vivo to inhibit a biological activity of 
oncostatin M (including signal transduction). The inventive receptors also have use as 
LIF binding reagents. 

35 Disorders mediated by either oncostatin M or LIF may be treated by 

administering a therapeutically effective amount of the receptor of the present invention 
to a human or mammalian patient afflicted with such a disorder. A disorder is said to 
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be mediated by oncostatin M or LIF when biologically active oncostatin M or LIF 
causes (directly or indirectly) or exacerbates the disorder. Soluble receptor proteins can 
be used to competitively bind to LIF and oncostatin M, thereby inhibiting binding of 
LIF and oncostatin M to cell surface receptors. 
5 As discussed in example 2, gpl30 has now been found to bind oncostatin M, 

although with lower affinity than the inventive receptors comprising both gp!30 and 
LIF-R. gpl30 may be administered to treat conditions mediated by oncostatin M, 
although a gpl30/LIF-R receptor of the present invention would be preferred for such a 
purpose. 

10 Oncostatin M has been reported to stimulate hematopoiesis, stimulate epithelial 

cell proliferation, increase plasmin activity (thereby inducing fibrinolysis), inhibit 
angiogenesis and supress expression of major histocompatibility complex antigens on 
endothelial cells. See PCT application WO 9109057 and European patent application 
no, 422,186. When these or other biological effects of oncostatin M are undesirable, a 

15 receptor of the present invention may be administered to bind oncostatin M. 

Oncostatin M is believed to stimulate production of the cytokine interleukin-6 
(EL-6), as reported by Brown et al., J, Immunol. 147:2115 (1991). Oncostatin M 
therefore indirectly mediates disorders associated with the presence of EL-6. EL-6 has 
been reported to be involved in the pathogenesis of A IDS -associated Kaposi's sarcoma 

20 (deWit et al., J. Intern, Med. [England] 229:539 [1991]). Binding of oncostatin M by 
a receptor of the present invention thus may be useful in treating Kaposi's sarcoma. 
Alternatively, but less preferably, gpl30 may be administered to treat Kaposi's 
sarcoma. 

Among the disorders mediated by LIF are lipoprotein metabolism defects such 
25 as atherosclerosis and obesity, as well as disorders of bone and calcium metabolism or 
disorders associated with LIF overproduction that affect hepatocytes, neurons, or 
leukocytes. The regulation of embryonic and hematopoietic stem cells by LEF may also 
be manipulated with the receptor. A soluble form of the receptor may also be used to 
treat leukemic cells which respond to LEF by proliferating. LEF also may play a role in 
30 inducing cachexia in cancer or AIDS patients. The receptor, or antibodies thereto, may 
also be useful as a diagnostic reagent to detect diseases characterized by the presence of 
abnormal LIF-R. 

Oncostatin M and LIF are different proteins, but share certain structural and 
biological properties. If inhibition of a biological activity shared by oncostatin M and 
35 LIF is desired, the receptor of the present invention offers the benefit of binding both of 
these proteins exhibiting the particular biological activity. A receptor binding only one 



WO 93/10151 




PCT/US92/10272 



24 



of the proteins would leave the other protein active and continuing to mediate the 
disorder. 

Receptor proteins or derivatives thereof may also be used as reagents in 
receptor-based immunoassays, reagents in assays for oncostatin M or LIF, or as 
5 binding agents for affinity purification of oncostatin M or LIF. The receptor proteins of 
the present invention may be used as immunogens in conventional procedures for 
production of polyclonal or monoclonal antibodies. Such antibodies may be employed 
on immunoaffinity columns for purification of the receptor, or as components of 
diagnostic or research assays. Derivatives may also be obtained by attacking additional 

10 polypeptide(s), e.g., by using a cross-linking agent, such as N-maleimidobenzoyl 
succinimide ester that reacts with cysteine and lysine residues. Receptor proteins may 
also be covalently bound through reactive side groups to various insoluble substrates, 
such as cyanogen bromide-activated, bisoxirane-activated, carbonyldiimidazole- 
activated or tosyl-activated agarose structures, or by adsorbing to polyolefin surfaces 

15 (with or without glutaraldehyde cross-linking). 

The following examples are provided to illustrate certain embodiments of the 
invention, and are not to be construed as limiting the scope of the invention. 



25 essentially as described by Hopp, et al., Bio/Technology 6:1204 (1988). The purified 
protein is radiolabeled using a commercially available enzymobead radioiodination 
reagent (BioRad). In this procedure 10 |ig LIF in 50 (il 0.2 M sodium phosphate, pH 
7.2, are combined with 50|il enzymobead reagent, 2 mCi of sodium iodide in 20 |il of 
0.05 M sodium phosphate pH 7.0 and 10 |il of 2.5% B-D-glucose. After 10 minutes at 

30 25 # C, sodium azide (20 jil of 50 mM) and sodium metabisulfite (10 |xl of 5 mg/ml) are 
added and incubation is continued for 5 min. at 25*C. The reaction mixture is 
fractionated by gel filtration on a 2 ml bed volume of Sephadex® G-25 (Sigma) 
equilibrated in Roswell Park Memorial Institute (RPMI) 1640 medium containing 2.5% 
(w/v) bovine serum albumin (BSA), 0.2% (w/v) sodium azide and 20 mM Hepes pH 

35 7.4 (binding medium). The final pool of H^l-LIF is diluted to a working stock 
solution of 3 x 10"^ M in binding medium and stored for up to one month at 4*C 



EXAMPLES 



20 



Example 1 

Assay to Detect Binding of LIF 



Recombinant human LIF is expressed in yeast and purified to homogeneity 
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without detectable loss of receptor binding activity. The specific activity is routinely in 
the range of 6-8 x 10 15 cpm/mmole LIF. 

The radiolabeled LIF may be employed in any of a number of conventional 
assay procedures to determine whether a given protein or cell binds LIF. Examples of 
5 such assays are those that detect binding of the radiolabeled LIF to cells expressing an 
LIF-binding protein on the cell surface. The radiolabeled LIF also may be employed in 
assays for the presence of LIF-binding proteins in cell culture medium (e.g. LEF- 
binding proteins secreted by recombinant cells). Proteins in cell extracts (e.g. from 
recombinant cells) also may be assayed for the ability to bind the radiolabeled LIF. 

10 In one assay procedure, cells transformed/transfected with an expression 

system encoding a protein to be tested for ability to bind LEF are plated at a density of 2 
x 10^ cells/well in either 6 well dishes (Falcon) or single well chambered slides (Lab- 
Tek). Both dishes and slides are pretreated with 1 ml human fibronectin (10 ug/ml in 
PBS) for 30 minutes followed by 1 wash with PBS. After 48 to 72 hours, cells are 

15 assayed for expression of LEF binding proteins by binding radioiodinated LEF using the^ 
following slide autoradiography technique, Transfected cells are washed once with 
binding medium (RPMI media 1640 containing 25 mg/ml bovine serum albumin 
(BSA), 2 mg/ml sodium azide, 20 mM HEPES, pH 7.2, and 50 mg/ml nonfat dry milk 
(NFDM) and incubated for 2 hours at 4 # C with 1 ml binding medium + NFDM 

20 containing 1.25 x 10* 9 M 125 I -LIF. After incubation, cells in the chambered slides are 
washed three times with binding buffer + NFDM, followed by 2 washes with PBS, pH 
7.3, to remove unbound 

125i 

-LEF. The cells are fixed by incubating for 30 minutes at 
room temperature in 10% gluteraldehyde in PBS, pH 7.3, washed twice in PBS, and 
air dried. The slides are dipped in Kodak NTB-2 photographic emulsion (5x dilution in 

25 water) and exposed in the dark for 12 hours to 7 days at 4*C in a light proof box. The 
slides are then developed for approximately 5 minutes in Kodak D19 developer (40 
g/500 ml water), rinsed in water and fixed in Agfa G433C fixer. The slides are 
individually examined with a microscope at 25-40x magnification and positive cells that 
bind LIF are identified by the presence of autoradiographic silver grains against a light 

30 background. 

Cells in the 6 well plates are washed once with binding buffer * NFDM 
followed by 3 washings with PBS, pH 7.3, to remove unbound 1 25j_ljp ce u s 
are then trypsinized to remove them from the plate and bound ^5j_ljp j s counted on a 
gamma counter. 

35 The cells in transfectant pool(s) testing positive are subdivided into smaller 

pools and the screening process is repeated (with further subdividing of the pools as 
necessary; until an individual clone expressing LIF-binding protein is isolated. Non- 
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specific binding of ^I-LIF ma y ^ e measured in the presence of 200-fold or greater 
excess of unlabeled LIF. As a control, the same host cells transfected with a vector 
lacking LIF-R-encoding sequences should be assayed to determine whether 
background endogenous LIF receptors are present on the host cells. 
5 In another assay procedure, cells producing a soluble LIF-binding protein that 

is released from the cells into the culture medium may be identified. Cells are collected 
by centrifugation from a culture broth. The supernatant (culture medium) is 
concentrated 10-fold, and 1 |il aliquots are spotted onto nitrocellulose filters and 
allowed to air dry. Additional binding sites are blocked by overnight incubation at 4*C 
10 in the above-described binding medium containing 3% non-fat dry milk (BMNFDM). 
Filters are incubated for 2h at 4*C in BMNFDM containing 1 nM ^^l-LTF in the 
presence or absence of 200 nM unlabeled LIF, then washed (3x5 min) in PBS. 
Filters are exposed to photographic film for 48 hr at room temperature. 

The results of one LIF binding assay conducted according to the following 
15 procedure are shown in Figure L Host cells transfected with vector(s) encoding LIF-R 
or gpl30 as described below were assayed for the ability to bind LIF. The host cells 
were the monkey kidney cell line designated COS-7, described by Glutzman, Cell 
25:175 (1981). In separate transfections, COS-7 cells were transfected with the 
following combinations of vectors. The different types of transfected cells (and non- 
20 transfected control cells) are designated A-F as shown below, and the curves 
representing the LIF-binding assay data for each transfected or control cell type are also 
labeled A-F in Figure 1 . 

(A) B10G (the gpl30 encoding vector described in example 3) and 
25 pHLIFR-65 (the LIF-R encoding vector described in example 3) 

(B) pHLIFR-65 and control vector CAV (a control vector that does not 
encode LIF-R or gpl30; controls for plasmid dilution so that results can be more 
accurately compared with those of COS-7 cells co-transfected with both a gpl30 

30 encoding vector and an LEF-R encoding vector) 

(C) B10G and pHLIFR-65; transfected cells were preincubated with 
non-radiolabeled oncostatin M before incubation with 125 I-LEF 



35 (D) pHLIFR-65 and CAV; transfected cells were preincubated with non- 

radiolabeled oncostatin M before incubation with 125 I-LEF 
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(E) non-transfected COS-7 cells (control) 

(F) B10G andCAV 

5 The assay was performed by a phthalate oil separation method essentially as 

described by Dower et al., J. Immunol. 132:151 (1984) and Park et al., J. Biol. Chem. 
261:4117 (1986). Briefly, the COS-7 host cells were released from 10 cm tissue 
culture plates two days after transfection by incubation in non-enzymatic cell 
dissociation buffer (Sigma) at 37*C for 30-60 minutes. Cells were then washed with 

10 the above-described binding medium and resuspended in binding medium at SxlO 6 
cells/ml. 50(il aliquots of the cells were incubated with serial dilutions of 125 I-LEF at 
room temperature for one hour with agitation (in the presence or absence of a 200-fold 
excess of unlabeled LIF) in a total volume of 150(iL The unlabeled LIF allowed for 
calculation of the non-specific background binding of LEF. Duplicate aliquots (60|il) of 

15 each incubation mixture were then transferred to a polyethylene centrifuge tube 
containing a phthalate oil mixture comprising 1.5 pans dibutylphthalate to 1 pan bis(s- 
ethylhexyl)phthalate. 

The cells were separated from unbound 125 I-LIF by centrifugation for five 
minutes at 15,000 x g in an Eppendcrf microfuge. The centrifuge tubes were cut to 

20 separate the pellet of cells (containing bound 125 I-LIF) from the supernatant containing 
unbound 125 I-LIF. The radioactivity in both pans was then determined on a gamma 
counter. The determinations of both cell-bound and unbound radioactivity from the 
two 60jil aliquots were averaged for subsequent calculations. 

The results are presented in Figure 1 as standard Scatchard transformations of 

25 the biological data. The data are reponed as the ratio of molecules of 125 I-LIF bound 
per cell, to free 125 I-LIF molecules (y-axis) versus molecules of 125 I-LIF bound per 
cell (x-axis). The dissociation constants (Kd) are shown in Figure 1, along with the 
number of LIF-binding sites per cell. Since a saturating amount of radiolabeled LIF 
was offered, the number of molecules of radiolabeled LIF bound per cell is considered 

30 equivalent to the number of LIF binding sites per cell. 

As shown by curve A of Figure 1, COS-7 cells co-transfected with a gpl30 
encoding vector (B10G) and an LIF-R encoding vector (pHLIFR-65) demonstrated 
high affinity LIF binding (Kd=9x10' 10 M) . When these same co-transfected COS-7 
cells were preincubated with non-radiolabeled oncostatin M before incubation with 

35 125 I-LIF (curve C). binding of LIF was greatly reduced (Kd=4.2x10"^M). Oncostatin 
M thus competes with LEF for binding sites on these transfected cells. 
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COS-7 cells transfected with a vector encoding the single-polypeptide chain 

LIF-R (pHLIFR-65) and with the control vector CAV bound LIF (curve B; 

Kd=2.4x10 _9 M), but with lower affinity than the cells producing both gp!30 and LIF- 

R. The COS-7 cells display endogenous high affinity simian LIF receptors (curve E: 
5 Kd about 3xl0~ n M). Transfection with pHLEFR-65 (encoding the single polypeptide 

LIF-R) results in display of additional low affinity LIF receptors (Kd=2.4x10~ 9 M; 

curve B, site 2) as well as the simian LIF receptors Kd=3.3x10" 11 M; curve B, site 1). 
When the COS-7 cells transfected with pHLIFR-65 and CAV were 

preincubated with non-radiolabeled oncostatin M before incubation with ^I-LIF 
10 (CURVE D), binding of LIF to the LIF-R expressed by pHLIFR-65 was essentially 

unchanged compared to the same transfected cells not preincubated with oncostatin M. 

Oncostatin M thus does not compete with LIF for binding to the single polypeptide 

chain LIF-R. However, the binding of LIF to the endogenous simian high affinity 

LIF-R on the COS-7 cells was competed. 
15 The COS-7 cells co- transfected with the gpl30 encoding vector and the CAV 

control vector (curve F) did not bind LIF in any measurable amount above the amount 

of binding to the non-transfected COS-7 cells (curve E). 

Example 2 

20 Assay to Detect Binding of Oncostatin M 

Oncostatin M may be purified from cells in which the protein is naturally found, 
or from cells transformed with an expression vector encoding oncostatin M. One 
source of oncostatin M is phorbol ester- treated U937 cells, as described by Zarling et 

25 al., PNAS U.SA. 55:9739 (1986). Purification of recombinant oncostatin M is 
described by Linsley et al., J. BioL Chern. 264:4282-4289 (1989), which is hereby 
incorporated by reference in its entirety. 

Preferably, oncostatin M is produced in yeast cells transformed with a suitable 
expression vector. A DNA sequence encoding a signal sequence (e.g., a yeast alpha- 

30 factor leader sequence) may be fused to the N-terminus of the oncostatin M encoding 
DNA sequence to promote secretion of the protein from the host cells. The protein 
when initially produced may also comprise an N-terminal identification leader (e.g., a 
"flag" sequence such as Asp-Tyr-Lys-Asp4-Lys) as described by Hopp et al., 
Bio/Technology 6:1204 (1988). The flag sequence is highly antigenic and provides an 

35 epitope reversibly bound by a specific monoclonal antibody, enabling facile purification 
of the expressed recombinant protein. This sequence is also specifically cleaved by- 
bovine mucosal enterokinase at the residue immediately following the Asp-Lys pairing. 
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Neither the signal sequence nor the flag sequence is found on the processed final 
oncostatin M product. 

Oncostatin M may be radiolabeled using any suitable conventional procedure, 
such as the radioiodination procedure employed to radiolabel LIF in Example 1. The 
5 radio-iodination of oncostatin M has also been described by Linsley et al., supra. 

The resulting radiolabeled oncostatin M may be substituted for radiolabeled LIF 
(using the same concentrations and other reaction parameters) in the assay procedures 
described in Example 1 in order to detect proteins and cells that bind oncostatin M. An 
assay for binding of ^5i_ oncostat j n y[ t0 ce n s j s a j so described in Linsley et al., 
10 supra. 

The results of one oncostatin M binding assay are shown in Figure 2. COS-7 
cells transfected with vector(s) encoding gpl30 or LEF-R were assayed for the ability to 
bind oncostatin M. In separate transfecrions, COS-7 cells were transfected with the 
following combinations of vectors. The different types of transfected cells (and non- 
15 transfected control cells) are designated A-E as shown below, and the corresponding 
curves representing the oncostatin M binding assay data for each cell type are also 
labeled A-E in Figure 2. 

(A) B10G (the gpl30 encoding vector described in example 3) and 
20 pHLIFR-65 (the LIF-R encoding vector described in example 3) 

(B) B10G and pHLIFR-65; transfected cells were preincubated with 
non-radiolabeled LIF before incubation with 125 I-oncostatin M 

25 (C) pHLIFR-65 and CAV (a control vector that does not encode LIF-R 

or gpl30, controls for plasmid dilution so that results can be more accurately compared 
with those of COS-7 cells co-transfected with both a gpl30 encoding vector and an 
LEF-R encoding vector) 

30 (D) non-transfected COS-7 cells (control) 

(E) B10G and CAV 

The assay was performed by the phthalate oil separation method described in 
35 example 1 (but substituting oncostatin M for LIF). The results are presented in Figure 
2 as standard Scatehard transformations of the biological data. The data are reported as 
the ratio of molecules of I-oneostatin M bound per cell, to free ^I-oneosrat::*. M 
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molecules (y-axis) versus molecules of 125 I-oncostatin M bound per cell (x-axis). The 
dissociation constants (Kd) are shown in Figure 2, along with the number of oncostatin 
M-binding sites per cell. Since a saturating amount of radiolabeled oncostatin M was 
offered, the number of molecules of radiolabeled oncostatin M bound per cell is 
5 considered equivalent to the number of oncostatin M binding sites per cell. 

As shown by curve A in Figure 2, COS-7 cells co-transfected with a gpl30 
encoding vector (B10G) and an LIF-R encoding vector (pHLEFR-65) demonstrated the 
ability to bind oncostatin M with high affinity (Kd-2.4x10* 10 M). 

COS-7 cells co-transfected with a vector encoding the single-polypeptide chain 
10 LIF-R (pHLEFR-65) and with the control vector CAV (curve C) did not bind oncostatin 
M in any significant amount above that bound by the non-transfected COS-7 cells 
(curve D). 

COS-7 cells co-transfected with pHLIFR-65 and B10G and preincubated with 
non-radiolabeled LEF before incubation with l 25 I-oncostatin M (curve B) did not bind 
15 oncostatin M in any measurable amount above that bound by the non-transfected COS- 
7 cells. LEF thus competes with oncostatin M for binding sites on the recombinant 
cells. 

The experimental conditions of this assay (the results of which are shown in 
Figure 2) were not appropriate for accurate detection of low affinity oncostatin M 

20 receptors. Thus, a separate experiment (phthalate oil separation method) was 

conducted to compare oncostatin M binding by COS-7 cells transfected with B10G 
alone (no CAV control vector) with oncostatin M binding to non-transfected COS-7 
cells. Non-transfected COS-7 cells assayed as a control demonstrated a small number 
of high affinity oncostatin M receptors (Kd=3.6x1O~ 10 M). The cells transfected with 

25 B10G demonstrated additional low affinity binding of oncostatin M (Kt>=7.7xl0* 9 M). 
The results of this oncostatin M binding assay are shown in Figure 3 as Scatchard 
transformations of the biological data. The data are reported as the ratio of molecules of 
125 I-oncostatin M bound per cell, to free 125 I-oncostatin M molecules (y-axis) versus 
molecules of l 25 I-oncostatin M bound per cell (x-axis). The scale in Figure 3 differs 

30 from that of Figures 1 and 2 so that the difference in oncostatin M binding by the 
gpl30-producing cells compared to the control cells can be more readily visualized. 

Disorders mediated by oncostatin M thus may be treated by administering 
gp 130 or a fragment thereof Receptors comprising both gpl30 and LIF-R are 
preferred for use in treating such conditions, however, in view of the higher affinity of 

35 such receptors for oncostatin M compared to the affinity of gpl 30 alone for oncostatin 
M. gpl 30 also may be employed as an oncostatin M-binding reagent in diagnostic and 
research assays. 
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Example 3 

Preparation of a Recombinant Fusion Protein Designated LDF-R-Linker-gpl30 

5 A recombinant receptor protein of the present invention is prepared by the 

following procedure. The receptor comprises an LIF-R fragment at the N-terminus 
attached to a gp 130 fragment through a polypeptide linker. The polypeptide linker is of 
the formula (Gly4Ser)8. An oligonucleotide encoding a portion of the linker sequence, 
i.e., the sequence Ser(Gly4Ser)6Gly is synthesized by any of the conventional known 
10 procedures for oligonucleotide synthesis. The DNA and encoded amino acids 
sequences of the double-stranded oligonucleotide are as follows; 

SEQ ID NO: 7 

15 Bam HI 

5' GATCC GGT Go A GGT GGT TCT GGT GGA GGT GGT TCA GGT GGT GGA GGA TCA 
3 ' G CCT CCT CCA CCA AGA CCA CCT CCA CCA AGT CCA CCA CCT CCT AGT 
Ser Glv Glv Glv Gly Ser Gly Giy Gly Gly Ser Gly Glv Gly Giy Ser 

20 

BspMII Xbal 

GGA GGT GGT GGA TCA GGT GGA GGA GGT TCT GGA GGT GGA GGT TCC GGA T 3' 
25 CCT CCA CCA CCT AGT CCA CCT CCT CCA AGA CCT CCA CCT CCA AGG CCT AGATC 
5' 

Gly Gly Gly Gly Ser Gly Gly Giy Gly Ser Gly Gly Giy Giy Ser Glv 

The remaining portion of the linker is added during vector construction as described 
30 below. This oligonucleotide as well as those discussed below may be synthesized on 
an automated DNA synthesis machine such as those available from Biosearch, Inc., 
San Rafael, California or Applied Biosystems. 

The linker encoding oligonucleotide is cloned into a vector that preferably 
contains multiple restriction endonuclease cleavage sites that may be employed for 
35 inserting the sequences encoding LIF-R and gpl30 on either side of, and in the same 
reading frame as, the sequence encoding the linker. One such vector is designated 
pBLUESCRIPT SK® which is available from Stratagene Cloning Systems, La Jolla, 
California. This plasmid vector is replicable in E. coli and contains a polylinker 
segment that includes 21 unique restriction sites. The plasmid is digested with the 
40 restriction enzymes BamHI and Xbal and the linker-encoding oligonucleotide is ligated 
into the vector using conventional techniques. A recombinant vector containing the 
inserted oligonucleotide seauenee is identified bv restriction endonuclease anaivsis and 
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sizing by gel electrophoresis. A DNA sequence encoding LIF-R is inserted into the 
pBLUESCRIPT SK® vector upstream of the linker- encoding oligonucleotide and a 
DNA sequence encoding gpl 30 is inserted downstream of the linker sequence. cDNA 
molecules encoding soluble fragments of LIF-R and gpl 30 were isolated and amplified 
5 using the well known polymerase chain reaction (PCR) procedure. The following 
oligonucleotides were synthesized for use in the PCR procedures: 

SEQ ID NO: 8 (Oligonucleotide No. 1) 

10 Sail 

5 ' GATATGTCGACGATGATGGATATTTACGTATGTTTG 3 ' 

1 5 SEQ ID NO: 9 (Oligonucleotide No. 2) 

3 ' CATACATACACCACTGTTTCCTTTTAAGACCTCCTCCA CCTAGG TACG 5 ' 

BamKI 

20 

SEQ ID NO: 10 (Oligonucleotide No. 3) 
BspMII 

25 5 ' CGCGTCCGGAGGAGGTGGATCTGAACTTCTAGATCCATGTGGTTATATC 3 ' 



SEQ ID No. 1 1 (Oligonucleotide No. 4) 

30 

3 ' CAAACGAGTTCCTCTTTAACTTATCCGCCGGCGTACG 5 * 

Oligonucleotides 1 and 2 are used in a PCR reaction to isolate a soluble 
fragment of LIF-R. The template employed in the reaction is the human LIF-R cDNA 

35 cloned as described by Gearing et al. supra. The DNA and encoded amino acid 
sequences of the cDNA clone are represented in SEQ ID NO: 5. The cloning vector 
which contains this human LIF-R cDNA clone was deposited in E. coli host cells with 
the American Type Culture Collection, Rockville, Maryland, U.S.A. on December 11, 
1990, under the name pHLIFR-65 (ATCC Accession Number 68491). The deposit 

40 was made under the conditions of the Budapest Treaty. The 5' primer is 
oligonucleotide No. 1, which includes a DNA sequence encoding the first 8 amino 
acids of the signal sequence of LIF-R and also comprises upstream sequences that 
introduce a Sal 1 restriction endonuclease cleavage site. Oligonucleotide No. 1 is 
capable of annealing to the (-) strand that is complementary to nucleotides 179-202 of 

45 SEQ ID NO: 5. The 3' pnmer is oligonucleotide No. 2. which contains a sequence 
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complementary to nucleotides 2651-2677 of SEQ ID NO: 5 (i.e., includes anti-sense 
nucleotides encoding the last nine amino acids of the extracellular domain of LIF-R). 
Immediately downstream of the LIF-R encoding sequence, oligonucleotide No. 2 
contains a sequence encoding (Gly)4 Ser, and also introduces a BamHI restriction 
5 endonuclease cleavage site. 

A PCR reaction employing oligonucleotides Nos. 1 and 2 thus isolates and 
amplifies a DNA sequence encoding an LEF-R fragment containing the entire signal 
sequence and the entire extracellular domain but lacking the transmembrane region and 
the extracellular domain. The (Gly)4 Ser sequence attached to the 3' terminus of the 

10 LIF-R fragment is pan of the polypeptide linker in the final construct. 

Any suitable PCR procedure may be employed. One such procedure is 
described in Sarki et al., Science 259:487 (1988). Another is described in 
Recombinant DNA Methodology, Wu et al., eds.. Academic Press Inc., San Diego 
(1989), pp. 189-196. In general, PCR reactions involve combining the 5' and 3' 

15 nucleotides with the template DNA and each of the four deoxynucleoside triphosphates 
in a suitable buffered solution. The solution is heated, (e.g, from 95" to 100'C) to 
denature the double-stranded DNA template and is then cooled before addition of a 
DNA polymerase enzyme. Multiple cycles of the reactions are carried out in order to 
amplify the desired DNA fragment. 

20 An example of a suitable PCR procedure is as follows. All temperatures are in 

degrees centigrade. The following PCR reagents are added to a 1.5 ml Eppendorf 
microfuge tube: 10 \i\ of 10X PCR buffer (500 mM KC1, 100 mM Tris-HCl, pH 8.3 
at 25'C, 25 mM MgCb, and 1 mg/ml gelatin) (Perkin-Elmer Cetus, Norwalk, CN), 10 
|il of a 2 mM solution containing each dNTP (2 mM dATP, 2 mM dCTP, 2 mM dGTP 

25 and 2 mM dTTP), 2.5 units (0.5 |il of standard 5000 units/ml solution) of Taq DNA 
polymerase (Perkin-Elmer Cetus), 50 ng of template DNA, 5 jjJ of a 20 (lM solution of 
each of oligonucleotide primers 1 and 2, and 74.5 |il water to a final volume of 100 |il. 
The final mixture is then overlaid with 100 \i\ parafin oil. PCR is carried out using a 
DNA thermal cycler (Ericomp, San Diego, CA) by initially denaturing the template at 

30 94* for 90 seconds, reannealing at 55* for 75 seconds and extending the cDNA at 72* 
for 150 seconds. PCR is carried out for an additional 20 cycles of amplification using a 
step program (denaturation at 94", 25 sec; annealing at 55\ 45 sec; extension at 72\ 
150 sec), followed by a 5 minute extension at 72°. 

The sample is removed from the parafin oil and DNA extracted by 

35 phenolchloroform extraction and spun column chromatography over G-50 (Boehringer 
Mannheim). A 10 ul aliquot of the extracted DNA is separated by electrophoresis on 
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1% SeaKem™ agarose (FMC BioProducts, Rockland, ME) and stained with ethidium 
bromide to confirm that the DNA fragment size is consistent with the predicted product. 

The PCR-amplified cDNA products are then digested with Sail and BamHI 
restriction enzymes using standard procedures. The Sall/BamHI restriction fragment is 
5 then separated by gel electrophoresis, e.g., on a 1.2% Seaplaque™ low gelling 
temperature (LGT) agarose, and the band representing the desired fragment is isolated. 
The fragment is inserted into a vector encoding the desired fusion protein as described 
below. 

A plasmid vector containing human gpl30 cDNA was deposited in E. coli 

10 strain DH5a host cells with the American Type Culture Collection, Rockville, 
Maryland under the name B10G/pDC303 (DH5a) on November 14, 1991 and 
assigned ATCC Accession No. 68827. The deposit was made under the conditions of 
the Budapest Treaty, The DNA and encoded amino acid sequences of this cloned 
cDNA are shown in SEQ ID NO: 1. 

15 Oligonucleotides 3 and 4 are employed in the polymerase chain reaction 

procedure to amplify and isolate a DNA fragment encoding Ser(Gly)4 Ser followed by 
amino acids 1 to 597 of SEQ ED NO: 1 (the entire extracellular domain of the mature 
gpl30 protein). The 5' primer, oligonucleotide No. 3, includes nucleotides 310 to 336 
of SEQ ED NO: 1, which encode the first nine amino acids of the mature gpl30 

20 protein. This nucleotide sequence is capable of annealing to the (-) strand that is 
complementary to nucleotides 310 to 336 of SEQ ED NO: 1. Oligonucleotide No. 3 
also encodes a Ser(Gly)4 Ser sequence directly upstream of (and in the same reading 
frame as) the gpl30 sequence, and further positions a BspMII restriction endonuclease 
cleavage site near the 5' terminus of the Ser(Gly)4 Ser -encoding sequence. 

25 The 3' primer, oligonucleotide No. 4, includes a sequence complimentary to 

nucleotides 2080 to 2100 of SEQ ID NO: 1, i.e., includes anti-sense nucleotides 
encoding the last seven amino acids of the gpl30 extracellular domain. Oligonucleotide 
No. 4 positions a stop codon immediately after the gpl30 sequence and also inserts a 
NotI restriction site downstream. Following amplification of the gpl30 fragment by 

30 PCR, the PCR reaction products are digested with BspMII and NotI and the desired 
fragment is isolated. 

The above-described LIF-R, Ser(Gly4Ser)6Gly linker, and gpl30 encoding 
fragments are assembled into a single DNA sequence as follows. The 
Ser(Gly4Ser)6Gly linker fragment is excised from the pBLUESCRIPT SK® vector by 

35 digestion with BamHI and BspMII. The linker fragment is then ligated to the 3' end of 
the LIF-R fragment (cleaved at its 3' terminus after the Gly4Ser sequence with 
BamHI). The ligation is conducted under conventional conditions. The 3' end of the 
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linker fragment is ligated to the BspMII-cleaved 5' end of the gpl30 fragment. The 
resulting DNA fragment encodes a receptor of the present invention comprising (from 
5' to 3') the signal sequence and extracellular domain of LIF-R attached to a (Gly4Ser)g 
polypeptide linker which is attached to the mature coding sequence of the gpl30 
5 extracellular domain. 

This DNA fragment may be inserted into any suitable cloning and/or expression 
vector. For example, the pBLUESCREPT SK® vector may be digested with Sail and 
Noil and the ligated DNA fragment inserted therein. E. coli cells are then transformed 
with the recombinant vector by conventional procedures. 

10 In an alternative procedure, the pBLUESCRIPT SK® vector containing the 

Ser(Gly4Ser)6Gly linker sequence is digested with Sail and BamHI and the above 
described LIF-R-encoding fragment is inserted therein. The resulting vector is then 
digested with BspMII and NotI and the gpl30-encoding fragment is then inserted to 
form the DNA sequence encoding the receptor of the present invention. The cloned 

15 receptor-encoding DNA fragment may be excised and inserted into any suitable 
expression vector (chosen according to the type of host cell that is desired) using 
conventional procedures. Host cells transformed with the recombinant expression 
vector are cultivated to produce the receptor protein. Mammalian host cells are 
generally preferred for producing the recombinant receptor fusion proteins of the 

20 present invention. 

The receptor-encoding construct may be excised by Sail and NotI digestion and 
inserted into a vector suitable for use in mammalian host cells. One suitable vector is 
designated pDC406. cDNA molecules inserted at the Sail site of this vector are 
transcribed and translated using regulatory elements derived from HIV and adenovirus. 

25 pDC406 contains origins of replication derived from SV40, Epstein-Barr virus and 

pBR322. The pDC406 vector into which interleukin-1 receptor cDNA has been cloned 
has been deposited with the American Type Culture Collection, Rockville, Maryland 
USA under accession number CRL10478. The interleukin-1 receptor cDNA may be 
excised from the vector using conventional techniques and replaced with the receptor 

30 encoding DNA of the present invention prepared above. pDC406 is a derivative of 
HAV-EO described by Dower et al., J. Immunol 142:4314 (1989). pDC406 differs 
from HAV-EO by the deletion of the intron present in the adenovirus 2 tripartite leader 
sequence in HAV-EO. 

Examples of suitable mammalian cells for expressing a receptor fusion protein 

35 include CV-1 cells (ATCC CCL70) and COS-7 cells, (ATCC CRL 1651 ) both derived 
from monkey kidney. Another monkey kidney cell line CV-1/EBNA f ATCC CRL 
10478) was derived by transfeenon of the CV-1 cell line with a gene encoding Epstein- 
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Barr virus nuclear antigen- 1 (EBNA- 1 ) and with a vector containing CMV regulatory 
sequences. See McMahan et al., EMBO J. 70:2821 (1991). The EBNA- 1 gene allows 
for episomal replication of expression vectors, such as HA V-EO or pDC406, that 
contain the EBV origin of replication. 

5 

Example 4 

Preparation of a Recombinant Receptor Fusion Protein Designating 

gp!30-Linker-LIF-R 

10 This receptor of the present invention differs from that of Example 3 in that the 

LEF-R polypeptide (which was the 5' polypeptide in the receptor of Example 3) is now 
the 3' polypeptide. The following oligonucleotides were synthesized for use in 
preparing the fusion protein: 

15 SEQIDNO:12 

5' GATATGTCGACAAGATGTTGACGTTGCAGACTTGG 3' (oligonucleotide no. 5) 

SEQIDNO: 13 

3 1 CAAACGAGTTCCTCTTTAACTTCCTCCTCCACCTAGGTACG 5 ' (oligonucleotide no. 6) 

20 

SEQIDNO: 14 

5 1 CGCGTCCGGAGGAGGTGGTAGCCAGAAAAAGGGGGCTCCTCATG 3 1 (oligonucleotide 

no. 7) 
25 SEQIDNO: 15 

3 1 CATACATACACCACTGTTTCCTTTTAAGAATCGCCGGCGTACG 5 ' (oligonucleotide no. 

8) 



Oligonucleotides 5 and 6 are employed in a polymerase chain reaction procedure 
30 to isolate a fragment of gpl30. The 5' primer (oligonucleotide number 5) includes 
nucleotides 244 to 264 of SEQ ID NO: 1, (the sequence encoding the first seven amino 
acids of the gpl30 signal sequence). Oligonucleotide number 5 also includes a 
sequence that introduces an upstream Sail site. This nucleotide sequence is capable of 
annealing to the (-) strand that is complementary to nucleotides 244 to 264 of SEQ ED 
35 NO: 1. The 3* primer (oligonucleotide number 6) includes a sequence complementary 
to nucleotides 2080 to 2100 of SEQ ID NO: 1, i.e. includes antisense nucleotides 
encoding the last seven amino acids of the gp-130 extracellular domain. 
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Oligonucleotide number 6 also encodes a Gl>'4Ser sequence immediately 3' to (and in 
phase with) the gpl30 sequence, and also insens a downstream BamHI site. 

A PCR reaction is conducted as described in Example 3 but employing 
oligonucleotides 5 and 6 on the gpl30 cDNA template. A DNA sequence encoding a 
5 gpl30 fragment that includes the 5' signal sequence and the entire extracellular domain, 
but none of the transmembrane region or the cytoplasmic domain, is isolated by the 
PCR reaction. A Gly4Ser sequence is fused to the 3' terminus of the gp 130 fragment. 
The PCR reaction products are digested with Sail and BamHI and the desired fragment 
is isolated. 

10 An LIF-R fragment is isolated and amplified by a PCR reaction employing 

oligonucleotides 7 and 8. The 5' primer (oligonucleotide number 7) includes 
nucleotides 31 1 to 331 of SEQ ID NO: 5, which encode the first seven amino acids of 
the mature LIF-R protein. This nucleotide sequence is capable of annealing to the (-) 
strand complementary to nucleotides 31 1 to 331 of SEQ ED NO: 5. Oligonucleotide 

15 number 7 also encodes a Gly4Ser sequence fused to the 5' end of the LIF-R sequence, 
and inserts an upstream BspMII site. The 3' primer (oligonucleotide number 8) is 
complementary to nucleotides 2651 to 2677 of SEQ ID NO: 5 (which encode the last 
nine amino acids of the LEF-R extracellular domain.) Oligonucleotide number 8 also 
adds a stop codon at the 3' end of *he LIF-R sequence, and insens a Not I site 

20 downstream. The PCR reaction pro ;ts are digested with BspMII and NotI and the 
desired fragment is isolated. 

A DNA sequence encoding the desired receptor protein is prepared by ligating 
the BamHI site of the gpl30 fragment prepared above to the BamHI site at the 5' 
terminus of the linker fragment described in Example 3. Likewise the C-terminus of 

25 the linker encoding fragment is ligated at the BspMII site to the complementary site of 
the LIF-R encoding fragment prepared above. The resulting DNA fragment may be 
cloned into an expression vector using procedures described in Example 3. The 
receptor encoded by the isolated DNA fragment comprises (from the N-terminus to the 
C-terminus) the signal sequence and extracellular domain of gpl30 attached to a 

30 (Gly4Ser)8 polypeptide linker which is attached to the mature coding sequence of the 
LEF-R extracellular domain. 
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Example 5 

Receptor Fusion Protein Comprising LIF-R Attached to gp-130 Through An Fc 

Polypeptide Linker 

5 

A receptor prepared in accordance with the following procedures is depicted in 
Figure 4. The following oligonucleotides were synthesized for use in preparing the 
receptor fusion protein: 
SEQIDNO: 16 

10 3' CATACATACACCACTGTTTCCTTTTAAGACTCGGGTCTAGATACG 5' (oligonucleotide 

no. 9) 

SEQIDNO: 17 

3' CAAACGAGTTCCTCTTTAACTTCTCGGGTCTAGATACG 5' (oligonucleotide no. 10) 

15 

An LEF-R encoding DNA sequence is isolated and amplified in a PCR reaction 
using oligonucleotides 1 and 9. Oligonucleotide number 1 (the 5' primer) inserts an 
upstream Sail site and has been described in Example 3. The 3' primer is 
oligonucleotide number 9 which includes a sequence complementary to nucleotides 

20 2651 to 2677 of SEQ ID NO: 5, i.e., includes antisense nucleotides encoding the last 
nine amino acids of the extracellular domain of LIF-R. Oligonucleotide number 9 also 
inserts a downstream Bglll site. The PCR reaction products are digested with Sail and 
Bglll, and the desired LIF-R encoding DNA fragment is isolated by gel electrophoresis 
using conventional procedures. Due to the presence of an internal Bglll site in the LIF- 

25 R sequence, the Bglll digestion should be carried out under conditions that effect partial 
digestion. 

A gpl30 encoding DNA fragment is isolated and amplified by PCR reaction 
using oligonucleotides 5 and 10. The 5' primer (oligonucleotide number 5) inserts an 
upstream Sail site and has been described above in Example 4. The 3' primer is 

30 oligonucleotide number 10, which includes a sequence complementary to nucleotides 
2080 to 2100 of SEQ ID NO: 1, i.e., includes antisense nucleotides encoding the last 
seven amino acids of the gpl30 extracellular domain. Oligonucleotide number 10 also 
inserts a downstream Bglll site. The PCR reaction products are digested with Sail and 
Bglll, and the desired gpl30 encoding DNA fragment is isolated by gel electrophoresis 

35 using conventional techniques. 

cDNA encoding a single chain polypeptide derived from the Fc region of a 
human IgGl antibody has been cloned into the above-described pBLUESCRIPT SK® 
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vector to produce a recombinant vector designated hlgGlFc. A unique Bglll site is 
positioned near the 5' end of the inserted Fc encoding sequence. An Spel site is 
immediately downstream of the stop codon. The DNA and encoded amino acid 
sequences of the cloned Fc cDNA are presented in SEQ ED NO: 3 and SEQ ED NO: 4. 
5 The Fc polypeptide encoded by the cDNA extends from the N-terminal hinge 

region to the native C-terminus, i.e., is an essentially full-length antibody Fc region. 
Fc fragments, e.g., those that are truncated at the C-terminal end, also may be 
employed. The fragments should contain multiple cysteine residues (at least the 
cysteine residues in the hinge reaction). The antibody from which the Fc polypeptide is 

10 derived is preferably of the same species as the patient to be treated with the fusion 
protein prepared therefrom.. 

Plasmid hlgGlFc is digested with Bglll and Sail and the Bglll/Sall LEF-R 
fragment prepared above is ligated into the vector by conventional techniques. The Fc 
encoding sequence is positioned downstream of, and in the same reading frame as, the 

15 LEF-R sequence. In a separate reaction, the above-described Sall/BglEI fragment of 
gpl30 is also inserted into the same vector. Plasmid vectors containing the desired 
DNA insert are identified by restriction endonuclease digestion analysis, using 
convention techniques. 

The cloned DNA segment encoding the LIF-R-Fc fusion polypeptide may be 

20 excised from the pBLUESCRIPT SK® vector by digestion with Sail and Notl. 
Likewise, the DNA segment encoding the gpl30-Fc fusion polypeptide may be excised 
by Sall/NotI digestion. Each excised DNA segment is inserted into an appropriate 
expression vector, depending on the type of host cell that is desired. One suitable 
expression vector is the plasmid pDC406, which may be transformed into mammalian 

25 host cells as described in Example 3. 

In one embodiment of the invention, an expression vector encoding the LIF-R- 
Fc fusion and a second expression vector encoding the gpl30-Fc fusion are co- 
transfected into the desired host cells. Two separate recombinant polypeptides are thus 
produced in the host cells. The first polypeptide comprises the Fc polypeptide fused in 

30 frame to the C-terminus of the gp 130 fragment. The second polypeptide comprises the 
Fc polypeptide fused in frame to the C-terminus of the LIF-R fragment. Disulfide 
bonds that form between the two Fc regions covalently link the two separate fusion 
polypeptides into a receptor protein of the present invention. 

Alternatively, the LIF-R-Fc and gp!30-Fc polypeptides may be separately 

35 transformed into host cells fas opposed to co-transfection into the same host cell.) The 
two polypeptides are purified from the host cells and then combined in a suitable 
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buffered solution, whereupon interchain disulfide bonds form between the two Fc 
regions. 

The receptor protein may be purified using any of a number of conventional 
protein purification techniques. Since antibody Fc regions bind to protein A and 
5 protein G, affinity chromatography employing protein A or protein G attached to an 
insoluble support material may be employed in the purification process. In one 
procedure, one liter of culture supernatant containing the receptor is passed over a solid 
phase protein G column, and the column is then washed thoroughly with phosphate- 
buffered saline (PBS). The adsorbed Fc-containing fusion protein is eluted with 50 
10 mM glycine buffer, pH 3 and brought to pH 7 with 2 M Tris buffer, pH 9. Further 
purification may involve immunoaffinity column(s), e.g., affinity columns having LIF 
or OSM bound thereto. 



Preparations of a purified receptor protein of the present invention, or 
transfected COS cells expressing high levels of the receptor, are employed to generate 
monoclonal antibodies against the receptor using conventional techniques, for example, 

20 those disclosed in U.S. Patent 4,411,993. To immunize mice, a receptor immunogen 
is emulsified in complete Freund's adjuvant and injected subcutaneously in amounts 
ranging from lO-lOOjag into Balb/c mice. Ten to twelve days later, the immunized 
animals are boosted with additional immunogen emulsified in incomplete Freund's 
adjuvant and periodically boosted thereafter on a weekly to biweekly immunization 

25 schedule. Serum samples are periodically taken by retro-orbital bleeding or tail-tip 
excision for testing by dot-blot assay (antibody sandwich) or ELISA (enzyme-linked 
immunosorbent assay). Other assay procedures are also suitable. Following detection 
of an appropriate antibody titer, positive animals are given an intravenous injection of 
antigen in saline. Three to four days later, the animals are sacrificed, splenocytes 

30 harvested, and fused to the murine myeloma cell line NS1. Hybridoma cell lines 

generated by this procedure are plated in multiple microtiter plates in a HAT selective 
medium (hypoxantine, aminopterin, and thymidine) to inhibit proliferation of non-fused 
cells, myeloma hybrids, and spleen cell hybrids. 



35 the receptor protein, for example, by adaptations of the techniques disclosed by Engvall 
et aL, Immunochem 8.811 (1971) and in U.S. Patent 4,704,004. Positive clones are 
then injected into the peritoneal cavities of syngeneic Balb/c mice to produce ascites 



15 



Example 6 

Preparation of Monoclonal Antibodies Directed against a Receptor 



Hybridoma clones thus generated can be screened by ELISA for reactivity with 
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containing high concentrations (greater than 1 mg/ml) of anti-receptor monoclonal 
antibody. The resulting monoclonal antibody can be purified by ammonium sulfate 
precipitation followed by gel exclusion chromatography, and/or affinity 
chromatography based on binding of antibody to Protein A of Staphylococcus aureus. 

Example 7 

Heterodimeric Receptor 



An expression vector encoding a fragment of the human LEF-R extracellular 
10 domain fused to a polypeptide derived from the Fc region of an antibody was 

constructed as follows. A second expression vector encoding a fragment of the human 
gpl30 extracellular domain fused to an Fc polypeptide also was constructed. 

Plasmid pHLIF-R-65 (ATCC 68491), which contains human LIF-R cDNA in 
expression vector pDC303 as described in example 3, was digested with the restriction 
15 enzymes Asp718 and Xmnl. Asp718 cleaves the vector upstream of the LEF-R cDNA 
insert. Xmnl is a blunt cutter that cleaves within the codon for amino acid number 702 
(Asp) of SEQ ID NO:5, upstream of the transmembrane region. The desired 
Asp718fXmnI fragment (about 2,444 bp in length) was separated by electrophoresis on 
an agarose gel and purified by conventional procedures using an Elutip column. 
20 A recombinant vector designated hlgGlFc, comprising cDNA encoding a single 

chain polypeptide derived from the Fc region of a human IgGl antibody in a 
pBLUESCRIPT SK® vector was described in example 5. The DNA and encoded 
amino acid sequences of the cloned Fc cDNA are presented in SEQ ID NO: 3 and SEQ 
ID NO:4. A polylinker region comprising a number of restriction sites is positioned 
25 immediately upstream of the Fc cDNA. 

Plasmid hlgGlFc was digested with Asp718 and StuI, which cleave within the 
pollylinker upstream of the Fc sequence. The Asp718PCmnl LIF-R fragment prepared 
above was ligated into the cleaved hlgGlFc vector by conventional techniques. Stul 
and Xmnl both produce blunt ends, which will iigate together. In the resulting 
30 recombinant vector, the Fc encoding sequence is positioned downstream of, and in the 
same reading frame as, the LEF-R sequence. The encoded LIF-R/Fc fusion protein 
comprises amino acids -44 to 702 of SEQ ID NO:5 (LEF-R), followed by six amino 
acids constituting a peptide linker encoded by the polylinker segment of plasmid 
hlgGlFc, followed by amino acids 1-232 of SEQ ID NO:3 (Fc). E. coli cells were 
35 transformed with the ligation mixture and plasmids were isolated therefrom by standard 
procedures. Plasmid vectors containing the desired DNA insert were identified by 
restriction endonuclease digestion analysis. 
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The cloned DNA segment encoding the LEF-R/Fc fusion polypeptide was 
excised from the recombinant vector by digestion with Asp718 and Notl. The Notl 
enzyme cleaves the vector in a polylinker region just downstream of the Fc cDNA 
insert The excised DNA segment (3.2 kb) is inserted into an appropriate expression 
5 vector, depending on the type of host cell that is desired. One suitable expression 
vector is pCA V/NOT, a mammalian expression vector described in PCT application 
WO 90/05183. 

pCA V/NOT was cleaved with Asp7I8 and Notl, both of which cleave in the 
multiple cloning site. The LIF-R/Fc-encoding Asp718fNotI DNA fragment prepared 

10 above was ligated into the vector. 

An expression vector encoding a soluble gpl30/Fc fusion protein was 
constructed as follows. Recombinant vector B10G/pDC303 (ATCC 68827) 
comprising human gpl30 cDNA (described in example 3) was digested with EcoRl, 
and the resulting 5' overhang was rendered blunt using T4 DNA polymerase. The 

15 recognition site for EcoRl comprises nucleotides 2056-2061 of SEQ ID NO:L The 
EcoRl -digested vector was then cleaved with Xhol, which cleaves in the vector 
upstream of the gpl30 cDNA insert . 

Plasmid hlgGlFc, comprising Fc polypeptide-encoding cDNA as described 
above, was digested with StuI (a blunt cutter) and Notl, which cleave upstream and 

20 downstream, respectively, of the inserted Fc cDNA. The (EcoRl )/XhoI gpl30 
fragment isolated above was ligated to the Fc-containing fragment and to Xhol/Notl- 
digested vector SF CA V/NOT. The mammalian expression vector SF CA V/NOT is 
essentially identical to SF CAV (ATCC 68922) but contains a Notl site. SF CA V/NOT 
also is essentially identical to pCA V/NOT, described in PCT application WO 90/05183, 

25 except that a segment of the adenovirus-2 tripartite leader (TPL) containing a cryptic 
promoter functional in bacteria has been deleted. Protein expression from the cryptic 
promoter is potentially disadvantageous for preparing and isolating a desired 
recombinant plasmid in bacterial cells. 

£. coli cells were transformed with the ligation mixture, plasmids were isolated 

30 therefrom by conventional procedures, and the desired recombinant plasmids were 
identified by restriction analysis. The gpl30/Fc fusion protein encoded by the desired 
recombinant vector comprises (from N- to C-terminus) amino acids -22 to 582 of SEQ 
ID NO:2 (gpl30), followed by 7 amino acids constituting a peptide linker encoded by 
the polylinker segment of plasmid hlgGlFc, followed by amino acids 1-232 of SEQ ID 

35 NO:4 (Fc). 

COS-7 cells (ATCC CRL 1651) were transfected with either the LIF-R/Fc- 
encoding recombinant expression vector or the gpl 30/Fc-encoding expression \ector 
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prepared above, or with both expression vectors. The cells were cultivated to allow 
expression of the soluble fusion proteins. The expressed proteins were recovered by 
incubating culture supernatant with Protein G Sepharose beads (available from 
Pharmacia) overnight at 4°C, then pelleting the beads by centrifugation. The binding of 
5 125 I-labeled human oncostatin M and 125 I-labeled human LIF by the proteins bound to 
the beads was analyzed. 

Binding affinity was determined by performing a variation of a standard 
Scatchard analysis. The binding assay procedure was similar to that described by 
Mosley et al. (Cell 59:335, 1989) except that the proteins, being soluble, are attached to 

10 Protein G Sepharose beads rather than being on the surface of the transfected cells. 
Briefly, in a 96-well microliter plate, each of ten 1:2 serial dilutions of 125 I-LEF or 125 I- 
oncostatin M was incubated with a sample comprising the expressed proteins (bound to 
the beads) resuspended in RPMI 1640 containing 2.5% bovine serum albumin, 0.29c 
(v/v) sodium azide and 20mM Hepes, pH 7.4, for 2 hours at 4°C with agitation. 

15 Duplicate standard cold competition wells also were incubated. Centrifuge tubes 
containing bovine calf serum were used in place of the phthalate oil mixture-containing 
tubes in the separation method described by Dower et al., J. Immunol. 132:751 (1984) 
and Park et al., J. Biol Chem. 261:4177 (1986) and in example 1 above. Aliquots of 
each incubation mixture were transferred to the tubes. After centrifugation, tubes were 

20 cut, the radioactivity counted, and processed as for standard Scatchard analysis. 

Figure 7 presents Scatchard analyses of the binding of 125 I oncostatin M by 
gpl30/Fc homodimers produced by the cells transfected with the gpl30/Fc vector alone 
(upper left) and by the proteins expressed by the co-transfected cells (lower left). 
Scatchard analyses of the binding of 125 I LIF by LIF-R/Fc homodimers produced by 

25 the cells transfected with the LIF-R/Fc vector alone (upper right) and by the proteins 
expressed by the co-transfected cells (lower right) are also presented in figure 7. A 
shift toward higher affinity binding of oncostatin M by the proteins recovered from the 
co-transfected cells, compared to the gpl30/Fc homodimer, is evident from figure 7. 
Likewise, the data in figure 7 indicate a shift toward higher affinity binding of LIF by 

30 the proteins recovered from the co-transfected cells, compared to the LIF-R/Fc 
homodimer. The shift toward higher affinity binding indicates the presence of 
heterodimers comprising LIF-R/Fc and gpl30/Fc, and further indicates that the LIF-R 
and gpl 30 moieties are cooperating, i.e., interacting, in the binding of oncostatin M 
and LIF. Controls demonstrated no oncostatin M binding by LIF-R homodimers. and 

35 no LIF binding by gp 1 30 homodimers. 
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Example 8 

Receptors Comprising LIF-R and gp!30 Polypeptides Lacking FNITI Domains 

DNA sequences encoding soluble LIF-R and gpl30 proteins lacking fibronectin 
5 type EI (FNH[) domains were isolated and fused to an Fc -encoding sequence. Deleting 
the FNIII domains affords the advantage of reducing the size of the LEF-R/Fc and 
gpl30/Fc fusion proteins. The LIF-R protein of SEQ ID NO:6 comprises three repeats 
of a fibronectin type El-like module in the extracellular domain. The three domains 
containing FNIII modules comprise amino acids 487 (Thr) to 584 (Asn), 585 (Asp) to 
10 679 (Ala), and 680 (Pro) to 789 (Ser), respectively, of SEQ ID NO:6. gpl30 also 
contains three FNIII domains, comprising amino acids 300 (Tyr) to 399 (Phe), 400 
(Gin) to 496 (Pro), and 497 (Pro) to 597 (Glu), respectively, of SEQ ID NO:2. From 
one to all three of the FNIII domains may be removed from gpl30 or LEF-R to reduce 
the size of the protein. 

15 The FNIII domains of human LEF-R were removed by digesting the LIF-R/Fc- 

encoding expression vector prepared in example 7 with the restriction endonuclease Eco 
O 1091 (isoschizomer of Dra II) and filling in the resulting overhangs using T4 DNA 
polymerase according to conventional procedures. The recognition site for Eco O 1091 
spans nucleotides 1789-1795 of SEQ ID NO:5 (LIF-R), cleaving within the codons for 

20 amino acids 8-9 of the first FNIII domain of LIF-R. The cleaved vector was then 
digested with BstXl and EcoR5. The recognition site for BstXl spans nucleotides 
1048-1059 of SEQ ID NO:5 and EcoR5 (which generates blunt ends) cleaves within 
the polylinker upstream of the Fc sequence. The BstXl/EcoR5 fragment (comprising 
the 5' end of LIF-R, the vector, the entire Fc sequence, and a portion of the polylinker) 

25 and the BstXl/(Eco O 1091) LIF-R fragment were isolated and ligated together. £. coli 
cells were transformed with the ligation mixture, plasmids were isolated therefrom, and 
the desired recombinant plasmid was identified by restriction analysis. The resulting 
construct encodes a fusion protein comprising (from N- to C-terminus) amino acids -44 
to 494 of SEQ ID NO:5 (LIF-R), a four amino acid spacer peptide -His-Arg-Tyr-Val- 

30 encoded by the polylinker segment, and amino acids 1-232 of SEQ ED NO:3 (Fc). The 
LIF-R polypeptide moiety contains the first 8 amino acids of the first FNIII domain, 
but lacks the remainder of the first FNIII domain and all of the second and third FNIII 
domains. 

The FNIII domains of gpl30 were removed by digesting the recombinant 
35 gpl30/Fc-encoding expression vector prepared in example 7 with BstXl, then blunting 
the overhang using T4 DNA polymerase according to conventional procedures. The 
recognition site for BstXl spans nucleotides 1231-1242 of SEQ ID NO:l (gpl30). 
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cleaving within the codons for amino acids 10-11 of the first FNIII domain of gpl30. 
The cleaved vector was then digested with EcoR5, which cleaves within the polylinker 
upstream of the Fc sequence and generates blunt ends. The (BstXl)/EcoR5 fragment 
comprising the 5' end of gpl30 (lacking the FNIII domains), the vector sequences, the 
5 Fc sequence, and a portion of the polylinker, was ligated. £. coli cells were 
transformed with the ligation mixture, plasmids were isolated therefrom, and the 
desired recombinant plasmid was identified by restriction analysis. The fusion protein 
encoded by the construct comprises (from N- to C-terminus) amino acids -22 to 308 of 
SEQ ID NO:2 (gpl30), a four amino acid spacer peptide -Asn-Arg-Tyr-Val- encoded 

10 by the polylinker segment, and amino acids 1-232 of SEQ ID NO:3 (Fc). The gpl30 
polypeptide moiety contains the first 9 amino acids of the first FNIII domain, but lacks 
the remainder of the first FNIII domain and all of the second and third FNIII domains. 

FNIII domains may be deleted from the gpl30 component of a receptor of the 
present invention, from the LEF-R component, or from both. In one embodiment of the 

15 invention, COS-7 cells were co-transfected with the soluble LIF-R/Fc-encoding 
mammalian expression vector prepared in example 7 and the mammalian expression 
vector encoding a soluble gpl30/Fc protein lacking the FNIII domains prepared above. 
Analysis of the expressed proteins by SDS-PAGE revealed a band of the molecular 
weight expected for the heterodimer, along with bands that include those of the 

20 molecular weight expected for the two homodimers. Scatchard analyses conducted 
according to the procedures described in example 7 demonstrated a shift toward higher 
affinity binding of LIF and oncostatin M for proteins expressed by the co-transfected 
cells compared to the corresponding homodimers. This result indicates the presence of 
heterodimers comprising LIF-R/Fc and gpl30/Fc, and further indicates that the LIF-R 

25 and gpl30 moieties are cooperating, i.e., interacting, in binding oncostatin M and LIF. 

BRIEF DESCRIPTION OF THE SEQUENCE LISTING 

SEQ ED NO:l and SEQ ID NO:2 present the DNA sequence and encoded amino acid 
30 sequence for cloned cDNA encoding an N-terminal fragment of gp 1 30. 

SEQ ID NO:3 and SEQ ID NO:4 present the DNA sequence and encoded amino acid 
sequence for cloned cDNA encoding a polypeptide that corresponds to the Fc region of 
an IgGl antibody. 

35 

SEQ ED NO:5 and SEQ ED NO:6 present the DNA sequence and encoded amino acid 
sequence for cloned cDNA encoding an N-terminal fragment of LEF-R. 
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SEQ ED NO:7 presents the DNA sequence of the coding strand of a chemically 
synthesized DNA molecule encoding a polypeptide linker used in constructing certain 
receptors of the present invention. 

SEQ ID NO:8 - SEQ ID NO: 17 present the DNA sequence of various single-stranded 
oligonucleotide primers employed in polymerase chain reactions to construct certain 
receptors of the present invention. 
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SEQUENCE LISTING 



(1) GENERAL INFORMATION: 

(i) APPLICANT: Gearing, David F . 
(ii) TITLE OF INVENTION : Receptor for Onccstatir. M and Leukemia 



(iii) NUM3ER OF SEQUENCES: 17 

<iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: Imraunex Corporation 

(B) STREET : 51 University Street 

(C) CITY: Seattle 

(D) STATE: WA 

(E) COUNTRY: USA 

(F) ZIP: 981CI 

(v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS /MS -DOS 

(D) SOFTWARE: Patentln Release #1.0, Version #1.25 

(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: 

(B) FILING DATE: 

(C) CLASSIFICATION: 

(viii } ATTORNEY /AGENT INFORMATION : 

(A) NAME: Seese, Kathryn A. 

(B) REGISTRATION NUMBER: 32,172 

(C) REFERENCE / DOCKET NUMBER: 2607 

(ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: 206-587-0430 

(B) TELEFAX: 206-587-0606 



(2) INFORMATION FOR SEQ ID NO : 1 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2369 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA to mRNA 

(ill) HYPOTHETICAL: NO 

(iv) ANTI -SENSE: NC 

(v) FRAGMENT TYPE: N- 1 e rr.ina 1 

(vi) ORIGINAL SOURCE: 



Inhibitory Factor 
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(B) CLONE: B10G/pDC303 



(ix) FEATURE: 

(A) NAME /KEY : CDS 

(B) LOCATION: 244.. 2369 

(ix) FEATURE: 

(A) NAME /KEY : mat_peptide 

(B) LOCATION: 310.. 2369 

(ix) FEATURE: 

(A) NAME /KEY : sig_peptide 

(B) LOCATION: 244 . . 309 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 1 : 



GGCCCGCGGA GTCGCGCTGG GCCGCCCCGG CGCAGCTGAA CCGGGGGCCG CGCCTGCCAG 6 0 

GCCGACGGGT CTGGCCCAGC CTGGCGCCAA GGGGTTCGTG CGCTGTGGAG ACGCGGAGGG 120 

TCGAGGCGGC GCGGCCTGAG TGAAACCCAA TGGAAAAAGC ATGACATTTA GAAGTAGAAG 180 

ACTTAGCTTC AAATCCCTAC TCCTTCACTT ACTAATTTTG TGATTTGGAA ATATCCGCGC 24 0 

AAG ATG TTG ACG TTG CAG ACT TGG CTA GTG CAA GCC TTG TTT ATT TTC 266 
Met Leu Thr Leu Gin Thr Trp Leu Val Gin Ala Leu Phe lie Phe 
-22 -20 -IS -10 

CTC ACC ACT GAA TCT ACA GGT GAA CTT CTA GAT CCA TGT GGT TAT ATC 336 
Leu Thr Thr Glu Ser Thr Gly Glu Leu Leu Asp Pro Cys Gly Tyr lie 
-5 15 

AGT CCT GAA TCT CCA GTT GTA CAA CTT CAT TCT AAT TTC ACT GCA GTT 384 
Ser Pro Glu Ser Pro Val Val Gin Leu His Ser Asn Phe Thr Ala Val 
10 15 20 25 

TGT GTG CTA AAG GAA AAA TGT ATG GAT TAT TTT CAT GTA AAT GCT AAT 4 32 

Cys Val Leu Lys Glu Lys Cys Met Asp Tyr Phe His Val Asn Ala Asn 
30 35 40 

TAC ATT GTC TGG AAA ACA AAC CAT TTT ACT ATT CCT AAG GAG CAA TAT 4 80 

Tyr lie Val Trp Lys Thr Asn His Phe Thr lie Pro Lys Glu Gin Tyr 
45 50 55 

ACT ATC ATA AAC AGA ACA GCA TCC AGT GTC ACC TTT ACA GAT ATA GCT 52 8 

Thr lie lie Asn Arg Thr Ala Ser Ser Val Thr Phe Thr Asp He Ala 
60 65 70 

TCA TTA AAT ATT CAG CTC ACT TGC AAC ATT CTT ACA TTC GGA CAG CTT 57 6 

Ser Leu Asn He Gin Leu Thr Cys Asn He Leu Thr Phe Gly Gin Leu 

75 80 85 

GAA CAG AAT GTT TAT GGA ATC ACA ATA ATT TCA GGC TTG CCT CCA GAA 62 4 

Glu Gin Asn Val Tyr Gly He Thr He He Ser Gly Leu Pro Pre Glu 
90 95 100 1C5 

AAA CCT AAA AAT TTG AGT TGC ATT GTG AAC GAG GGG AAG AAA ATG AGG 6~I 
Lys Pro Lys Asn Leu Ser Cys He Val Asn Glu Gly Lys Lys Met Arg 
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TGT GAG TGG GAT GGT GGA AG3 GAA ACA CAC TTG GAG ACA AAC TTC ACT 7 2 Z 

Cys Giu Trp Asp Gly Giy Arg Glu Thr His Leu Giu Thr A sr. Phe Tnr 
125 13 0 ~ ~ z 

TTA AAA TCT GAA TGG GCA ACA CAC AAG TTT GCT GAT TGC AAA GCA AAA 76 8 

Leu Lys Ser Glu Trp Ala Thr His Lys Phe Ala Asp Cys Lys Ala Lys 

14 0 * 145 * 150 ^ 

CGT GAC ACC CCC ACC TCA TGC ACT GTT GAT TAT TCT ACT GTG TAT TTT 816 

Arg Asp Thr Pro Thr Ser Cys Thr Val Asp Tyr Ser Thr Vai Tyr Phe 

155 160 * " 165 

GTC AAC ATT GAA GTC TGG GTA GAA GCA GAG AAT GCC CTT GGG AAG GTT 9 64 

Val Asn lie Glu Val Trp Val Glu Ala Glu Asn Ala Leu Giy Lys Val 

170 175 180 ~ 185 

ACA TCA GAT CAT ATC AAT TTT GAT CCT GTA TAT AAA GTG AAG CCC AAT 912 

Thr Ser Asp His lie Asn Phe Asp Pro Vai Tyr Lys Val Lys Pro Asr. 

19C 195 * 200 

CCG CCA CAT AAT TTA TCA GTG ATC AAC TCA GAG GAA CTG TCT AGT ATC 96 0 

Pro Pro His Asn Leu Ser Vai lie Asn Ser Glu Giu Leu Ser Ser lie 
205 210 215 

TTA AAA TTG ACA TGG ACC AAC CCA AGT ATT AAG AGT GTT ATA ATA CTA 1006 

Leu Lys Leu Thr Trp Thr Asn Pro Ser lie Lys Ser Vai lie lie Leu 

220 225 " 230 

AAA TAT AAC ATT CAA TAT AGG ACC AAA GAT GCC TCA ACT TGG AGC CAG 10 56 

Lys Tyr Asn lie Gin Tyr Arc Thr Lys Asp Ala Ser Tnr Trp Ser Gin 

235 240 "* 245 

ATT CCT CCT GAA GAC ACA GCA TCC ACC CGA TCT TCA TTC ACT GTC CAA 1104 

lie Pro Pro Glu Asp Thr Ala Ser Thr Arg Ser Ser Phe Thr Val Gin 

250 255 260 265 

GAC CTT AAA CCT TTT ACA GAA TAT GTG TTT AGG ATT CGC TGT ATG AAG 1152 

Asp Leu Lys Pro Phe Thr Glu Tyr Val Phe Arg lie Arg Cys Met Lys 

270 * 275 " 280 

GAA GAT GGT AAG GGA TAC TGG AGT GAC TGG AGT GAA GAA GCA AGT GGG 120 0 

Glu Asp Gly Lys Gly Tyr Trp Ser Asp Trp Ser Giu Glu Ala Ser Giy 
285 290 295 

ATC ACC TAT GAA GAT AGA CCA TCT AAA GCA CCA AGT TTC TGG TAT AAA 12 4 6 

He Thr Tyr Glu Asp Arg Pro Ser Lys Ala Pro Ser Phe Trp Tyr Lys 

300 305 310 

ATA GAT CCA TCC CAT ACT CAA GGC TAC AGA ACT GTA CAA CTC GTG TGG 12 96 

He Asp Pre Ser His Thr Gin Gly Tyr Arg Thr Val Gin Leu Val Trp 

315 320 " 325 

AAG ACA TTG CCT CCT TTT GAA GCC AAT GGA AAA ATC TTG GAT TAT GAA . j 4 4 

Lys Thr Leu Pro Pro Phe Glu Ala Asn Gly Lys He Leu Asp Tyr Glu 

330 335 " 3 4 C * " 34 5 

GTG ACT CTC ACA AGA TGG AAA TCA CAT TTA CAA AAT TAC ACA GTT AA. T 13 9: 

Val Thr Leu Tnr Arc Trp Lys Ser His Leu Gin Asn Tvr Tnr Val Asn 
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GCC ACA AAA CTG ACA GTA AAT CTC ACA AAT GAT CGC TAT CTA GCA ACC 144 C 

Ala Thr Lys Leu Thr Val Asn Leu Thr Asn Asp Arg Tyr Leu Ala Thr 
365 370 375 

CTA ACA GTA AGA AAT CTT GTT GGC AAA TCA GAT GCA GCT GTT TTA ACT 14 88 

Leu Thr Val Arg Asn Leu Val Gly Lys Ser Asp Ala Ala Val Leu Thr 
380 385 390 

ATC CCT GCC TGT GAC TTT CAA GCT ACT CAC CCT GTA ATG GAT CTT AAA 1536 
lie Pro Ala Cys Asp Phe Gin Ala Thr His Pro Val Met Asp Leu Lys 
395 400 405 

GCA TTC CCC AAA GAT AAC ATG CTT TGG GTG GAA TGG ACT ACT CCA AGG 1584 
Ala Phe Pro Lys Asp Asn Met Leu Trp Val Glu Trp Thr Thr Pro Arg 
410 415 420 425 

GAA TCT GTA AAG AAA TAT ATA CTT GAG TGG TGT GTG TTA TCA GAT AAA 1632 
Glu Ser Val Lys Lys Tyr lie Leu Glu Trp Cys Val Leu Ser Asp Lys 
430 435 440 

GCA CCC TGT ATC ACA GAC TGG CAA CAA GAA GAT GGT ACC GTG CAT CGC 1680 
Ala Pro Cys lie Thr Asp Trp Gin Gin Glu Asp Gly Thr Val His Arg 
445 450 455 

ACC TAT TTA AGA GGG AAC TTA GCA GAG AGC AAA TGC TAT TTG ATA ACA 172 8 

Thr Tyr Leu Arg Gly Asn Leu Ala Glu Ser Lys Cys Tyr Leu lie Thr 
460 465 470 

GTT ACT CCA GTA TAT GCT GAT GGA CCA GGA AGC CCT GAA TCC ATA AAG 177 6 

Val Thr Pro Val Tyr Ala Asp Gly Pro Gly Ser Pro Glu Ser lie Lys 
475 480 485 

GCA TAC CTT AAA CAA GCT CCA CCT TCC AAA GGA CCT ACT GTT CGG ACA 1824 
Ala Tyr Leu Lys Gin Ala Pro Pro Ser Lys Gly Pro Thr Val Arg Thr 
490 495 500 505 

AAA AAA GTA GGG AAA AAC GAA GCT GTC TTA GAG TGG GAC CAA CTT CCT 187 2 

Lys Lys Val Gly Lys Asn Glu Ala Val Leu Glu Trp Asp Gin Leu Pro 
510 515 520 

GTT GAT GTT CAG AAT GGA TTT ATC AGA AAT TAT ACT ATA TTT TAT AGA 192 0 

Val Asp Val Gin Asn Gly Phe lie Arg Asn Tyr Thr lie Phe Tyr Arg 
525 530 535 

ACC ATC ATT GGA AAT GAA ACT GCT GTG AAT GTG GAT TCT TCC CAC ACA 1968 
Thr lie lie Gly Asn Glu Thr Ala Val Asn Val Asp Ser Ser His Thr 
540 545 550 

GAA TAT ACA TTG TCC TCT TTG ACT AGT GAC ACA TTG TAC ATG GTA CGA 2016 
Glu Tyr Thr Leu Ser Ser Leu Thr Ser Asp Thr Leu Tyr Met Val Arg 
555 560 565 

ATG GCA GCA TAC ACA GAT GAA GGT GGG AAG GAT GGT CCA GAA TTC ACT 2 064 

Met Ala Ala Tyr Thr Asp Glu Gly Gly Lys Asp Gly Pro Glu Phe Thr 
570 575 580 585 

TTT ACT ACC CCA AAG TTT GCT CAA GGA GAA ATT GAA GCC ATA GTC GTG 2112 
Phe Thr Thr Pro Lys Phe Ala Gin Gly Glu lie Glu Ala He Val Val 

59C 595 600 
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CCT GTT TGC TTA GCA TTC CTA TTG ACA ACT CTT CTG GGA GTG CTG TTC 216C 

Pro Val Cys Leu Ala Phe Leu Leu Thr Thr Leu Leu Gly Val Leu Phe 
605 610 €15 

TGC TTT AAT AAG CGA GAC CTA ATT AAA AAA CAC ATC TGG CCT AAT GTT 22 CE 

Cys Phe Asn Lys Arg Asp Leu lie Lys Lys His lie Trp Pro Asn Val 
620 625 " 630 

CCA GAT CCT TCA AAG ACT CAT ATT GCC CAG TGG TCA CCT CAC ACT CCT 2256 

Pro Asp Pro Ser Lys Ser Hxs lie Ala Gin Trp Ser Pro His Thr Pro 

635 640 645 

CCA AGG CAC AAT TTT AAT TCA AAA GAT CAA ATG TAT TCA GAT GGC AAT 2 3C4 

Pro Arg His Asn Phe Asn Ser Lys Asp Gin Met Tyr Ser Asp Gly Asn 
650 655 ~ * 660 " 665 

TTC ACT GAT GTA AGT GTT GTG GAA ATA GAA GCA AAT GAC AAA AAG CCT 2352 

Phe Thr Asp Val Ser Val Val Glu lie Glu Ala Asn Asp Lys Lys Pro 
670 675 680 

TTT CCA GAA GAT CTG AA 2 36 9 

Phe Pro Giu Asp Leu 
685 



(2) INFORMATION FOR SEQ ID NO : 2 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 708 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 

Met Leu Thr Leu Gin Thr Trp Leu Val Gin Ala Leu Phe lie Phe Leu 
-22 -20 -15 -10 

Thr Thr Glu Ser Thr Gly Glu Leu Leu Asp Pro Cys Gly Tyr lie Ser 
-5 15 10 

Pro Glu Ser Pro Val Vai Gin Leu His Ser Asn Phe Thr Ala Val Cys 
15 20 25 

Val Leu Lys Glu Lys Cys Met Asp Tyr Phe His Val Asn Ala Asn Tyr 
30 35 40 

lie Val Trp Lys Thr Asn His Phe Thr lie Pro Lys Glu Gin Tyr Thr 
45 50 ' 55 

lie lie Asn Arg Thr Ala Ser Ser Val Thr Phe Thr Asp lie Ala Ser 
6 0 6 5 i: 

Leu Asn He Gin Leu Thr Cys Asn He Leu Thr Phe Gly Gin Leu Glu 

7 5 8 0 8 5 9C 

Gin Asn Vai Tvr Glv lie Tnr lie He Ser 31 v Leu Pre Pre Glu Lv s 
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Pro Lys Asn Leu Ser Cys He Val Asn Glu Gly Lys Lys Met Arg Cys 
110 115 120 

Glu Trp Asp Gly Gly Arg Glu Thr His Leu Glu Thr Asn Phe Thr Leu 
125 130 135 

Lys Ser Glu Trp Ala Thr His Lys Phe Ala Asp Cys Lys Ala Lys Arg 
140 145 150 

Asp Thr Pro Thr Ser Cys Thr Val Asp Tyr Ser Thr Val Tyr Phe Val 
155 160 165 170 

Asn He Glu Val Trp Val Glu Ala Glu Asn Ala Leu Gly Lys Val Thr 
175 180 185 

Ser Asp His He Asn Phe Asp Pro Val Tyr Lys Val Lys Pro Asn Pro 
190 195 200 

Pro His Asn Leu Ser Val He Asn Ser Glu Glu Leu Ser Ser He Leu 
205 210 215 

Lys Leu Thr Trp Thr Asn Pro Ser He Lys Ser Val He He Leu Lys 
220 225 230 

Tyr Asn He Gin Tyr Arg Thr Lys Asp Ala Ser Thr Trp Ser Gin He 
235 240 245 250 

Pro Pro Glu Asp Thr Ala Ser Thr Arg Ser Ser Phe Thr Val Gin Asp 
255 260 265 

Leu Lys Pro Phe Thr Glu Tyr Val Phe Arg lie Arg Cys Met Lys Glu 
270 275 280 

Asp Gly Lys Gly Tyr Trp Ser Asp Trp Ser Glu Glu Ala Ser Gly He 
285 290 295 

Thr Tyr Glu Asp Arg Pro Ser Lys Ala Pro Ser Phe Trp Tyr Lys He 
300 305 310 

Asp Pro Ser His Thr Gin Gly Tyr Arg Thr Val Gin Leu Val Trp Lys 
315 320 325 330 

Thr Leu Pro Pro Phe Glu Ala Asn Gly Lys He Leu Asp Tyr Glu Val 
335 340 345 

Thr Leu Thr Arg Trp Lys Ser His Leu Gin Asn Tyr Thr Val Asn Ala 
350 355 360 

Thr Lys Leu Thr Val Asn Leu Thr Asn Asp Arg Tyr Leu Ala Thr Leu 
365 370 375 

Thr Val Arg Asn Leu Val Gly Lys Ser Asp Ala Ala Val Leu Thr He 
360 385 " 390 

Pro Ala Cys Asp Phe Gin Ala Thr Has Pro Val Met Asp Leu Lys Ala 
395 400 405 41C 

Phe Pre Lys Asp Asn Met Leu Trp Val Glu Trp Thr Thr Pro Arg Glu 



415 



42C 



425 
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Ser Val Lys Lys Tyr lie Leu Glu Trp Cys Val Leu Se r Asp Lys Ala 
4 30 4 3*5 44 C 

Pro Cys lie Thr Asp Trp Glr. Gin Glu Asp Gly Tnr Val His Arc Thr 
445 * 45C 455 

Tyr Leu Arg Gly Asn Leu Ala Glu Ser Lys Cys Tyr Leu He Thr Val 
460 465 470 

Thr Pro Val Tyr Ala Asp Gly Pro Gly Ser Pro Glu Ser He Lys Ala 
475 480 485 490 

Tyr Leu Lvs Gin Ala Pro Pro Ser Lys Gly Pro Thr Val Arg Thr Lys 
495 50C 505 

Lys Val Gly Lys Asn Glu Ala Val Leu Glu Trp Asp Gin Leu Pro Val 
510 515 520 

Asp Val Gin Asn Gly Phe He Arg Asn Tyr Thr He Phe Tyr Arg Thr 
525 530 535 

He He Gly Asn Glu Thr Ala Val Asn Val Asp Ser Ser His Thr Glu 
540 545 55C 

Tyr Thr Leu Ser Ser Leu Thr Ser Asp Thr Leu Tyr Met Val Arg Met 
555 560 565 570 

Ala Ala Tyr Thr Asp Glu Gly Gly Lys Asp Gly Pro Glu Phe Thr Phe 
575 580 585 

Thr Thr Pro Lys Phe Ala Gin Gly Glu He Glu Ala He Val Val Pro 
590 595 600 

Val Cys Leu Ala Phe Leu Leu Thr Thr Leu Leu Gly Val Leu Phe Cys 
605 610 615 

Phe Asn Lys Arg Asp Leu He Lys Lys His He Trp Pro Asn Val Pro 
620 625 630 

Asp Pro Ser Lys Ser His lie Ala Gin Trp Ser Pro His Thr Pro Pro 
635 640 645 650 

Arg His Asn Phe Asn Ser Lys Asp Gin Met Tyr Ser Asp Gly Asn Phe 
655 * 660 665 

Thr Asp Val Ser Val Val Glu He Glu Ala Asn Asp Lys Lys Pro Phe 
670 675 680 

Pro Glu Asp Leu 
685 

(2) INFORMATION FOR SEQ ID NO : 3 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 705 case pairs 
(E) TYPE : nucleic acid 

(C) STRANDEDNES3 : single 

(D) TCPOLCGY: linear 
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(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 



(vii) IMMEDIATE SOURCE: 

(B) CLONE: hlgGIFc 

(ix) FEATURE: 

(A) NAME /KEY : CDS 

(B) LOCATION: I . .699 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 3 : 

GAG CCC AGA TCT TGT GAC AAA ACT CAC ACA TGC CCA CCG TGC CCA GCA 4 6 

Glu Pro Arg Ser Cys Asp Lys Thr His Thr Cys Pro Pro Cys Pro Ala 
15 10 15 

CCT GAA CTC CTG GGG GGA CCG TCA GTC TTC CTC TTC CCC CCA AAA CCC 96 
Pro Glu Leu Leu Gly Gly Pro Ser Val Phe Leu Phe Pro Pro Lys Pro 
20 25 30 

AAG GAC ACC CTC ATG ATC TCC CGG ACC CCT GAG GTC ACA TGC GTG GTG 144 
Lys Asp Thr Leu Met lie Ser Arg Thr Pro Glu Val Thr Cys Val Val 
35 40 45 

GTG GAC GTG AGC CAC GAA GAC CCT GAG GTC AAG TTC AAC TGG TAC GTG 192 
Val Asp Val Ser His Glu Asp Pro Glu Val Lys Phe Asn Trp Tyr Val 
50 55 60 

GAC GGC GTG GAG GTG CAT AAT GCC AAG ACA AAG CCG CGG GAG GAG CAG 24 0 

Asp Gly Val Glu Val His Asn Ala Lys Thr Lys Pro Arg Glu Glu Gin 
65 70 75 80 

TAC AAC AGC ACG TAC CGG GTG GTC AGC GTC CTC ACC GTC CTG CAC CAG 288 
Tyr Asn Ser Thr Tyr Arg Val Val Ser Val Leu Thr Val Leu His Gin 
85 90 95 

GAC TGG CTG AAT GGC AAG GAC TAC AAG TGC AAG GTC TCC AAC AAA GCC 336 
Asp Trp Leu Asn Gly Lys Asp Tyr Lys Cys Lys Val Ser Asn Lys Ala 
100 105 110 

CTC CCA GCC CCC ATG CAG AAA ACC ATC TCC AAA GCC AAA GGG CAG CCC 3 84 

Leu Pro Ala Pro Met Gin Lys Thr lie Ser Lys Ala Lys Gly Gin Pro 
115 120 125 

CGA GAA CCA CAG GTG TAC ACC CTG CCC CCA TCC CGG GAT GAG CTG ACC 4 32 

Arg Glu Pro Gin Val Tyr Thr Leu Pro Pro Ser Arg Asp Glu Leu Thr 
130 135 140 

AAG AAC CAG GTC AGC CTG ACC TGC CTG GTC AAA GGC TTC TAT CCC AGG 4 8C 

Lys Asn Gin Val Ser Leu Thr Cys Leu Val Lys Gly Phe Tyr Pro Arc 
145 150 155 160 

CAC ATC GCC GTG GAG TGG GAG AGC AAT GGG CAG CCG GAG AAC AAC TAC 52 9 

His lie Ala Val Glu Tro Glu Ser Asn Gly Gin Pro Glu Asn Asn Tvr 
165 17 C 175 
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AAG ACC ACG CCT CCC GTG CTG GAC TCC GAG GGC TCC TTC TTC CTC TAG 57 £ 

Lys Thr Thr Pro Pre Val Leu Asp Ser Asp Gly Ser Phe Phe Leu Tyr 

1BC :65 190 

AGC AAG CTG ACC GTG GAG AAG AGC AGG TGG GAG GAG GGG AAG GTG TTG 62 4 

Ser Lys Leu Thr Val Asp Lys Ser Arc Trp Gin Gin Gly A sr. Val Phe 

195 " 20C " 205 

TGA TGG TCG GTG ATG CAT GAG GGT CTG CAC AAG CAC TAG AGG GAG AAG €72 

Ser Cys Ser Val Met His Glu Ala Leu His Asn His Tyr Thr Gin Lys 

210 215 22C 

AGC CTC TCC CTG TCT CCG GGT AAA TGAACTAGT 7C5 

Ser Leu Ser Leu Ser Pro Gly Lys 

225 230 



(2) INFORMATION FOR SEQ ID NO : 4 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 32 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

Cii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 4 : 

Glu Pro Arg Ser Cys Asp Lys Thr His Thr Cys Pro Pre Cys Pro Ala 
1 5 10 15 

Pro Glu Leu Leu Gly Gly Pro Ser Val Phe Leu Phe Pro Pro Lys Pre 
20 " 25 3C 

Lys Asp Thr Leu Met lie Ser Arg Thr Pro Glu Val Thr Cys Val Val 
35 40 45 

Val Asp Val Ser His Glu Asp Pro Glu Val Lys Phe Asn Trp Tyr Val 
50 5*5 - 60 

Asp Gly Val Glu Val His Asn Ala Lys Thr Lys Pro Arg Glu Glu Gin 
65 70 75 80 

Tyr Asn Ser Thr Tyr Arg Val Val Ser Val Leu Thr Val Leu His Gin 
85 90 95 

Asp Trp Leu Asn Gly Lys Asp Tyr Lys Cys Lys Val Ser Asn Lys Ala 
100 " 105 11C 

Leu Pro Ala Pro Met Gin Lys Thr lie Ser Lys Ala Lys Gly Gin Pre 
115 12C 125 

Arg Glu Pro Gin Val Tyr Thr Leu Pre Pro Ser Arg Asp Glu Leu Tr.r 
130 135 140 



Lys Asn Gin Val Ser Leu Tr.r Cys Leu Val Lys Gly Pne Tyr Pre Arc 
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Lys Thr Thr Pro Pro Val Leu Asp Ser Asp Gly Ser Phe Phe Leu Tyr 
18C 185 190 

Ser Lys Leu Thr Val Asp Lys Ser Arg Trp Gin Gin Gly Asn Val Phe 
195 200 205 

Ser Cys Ser Val Met His Glu Ala Leu His Asn His Tyr Thr Gin Lys 
210 215 220 

Ser Leu Ser Leu Ser Pre Gly Lys 
225 230 



(2) INFORMATION FOR SEQ ID NO : 5 : 

(i) SEQUENCE CHARACTERISTICS: 

CA) LENGTH: 3182 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: cDNA to mRNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

(v) FRAGMENT TYPE: N-terminal 

(vi) ORIGINAL SOURCE: 

(F) TISSUE TYPE: human placenta 

(vii) IMMEDIATE SOURCE: 

<B) CLONE: pHLIFR-65 

fix) FEATURE: 

(A) NAME/KEY: mat_peptide 

(B) LOCATION: 311.. 3182 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 179.. 3182 

(ix) FEATURE: 

(A) NAME /KEY : sigjpeptide 

(B) LOCATION: 179~T. 310 

(ix) FEATURE: 

(A) NAME /KEY : mat_peptide 

(B) LOCATION: 311 . . 3182 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 5 : 
AGATCTTGGA ACGAGACGAC CTGCTCTCTC TCCCAGAACG TGTCTCTGCT GCAAGGCACC 6C 
GGGCCCTTTC GCTCTGCAGA ACTGCACTTG CAAGACCATT ATCAACTCCT AATCCCA.2CT 12 C 



CAGAAAGGGA GCCT — TGCoA ^TCAT*CATC G^^^T^^AGo A^TGAC* o^A 
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ATG ATG GAT ATT TAG GTA TGT TTG AAA GGA GGA TCC TGG ATG GTG GAC 226 

Met Met Asp lie Tyr Vai Cys Leu Lys Arg Pre Ser Trp Met Val Asp 

-44 ' -40 -35 -3C 

AAT AAA AGA ATG AGG ACT GCT TCA AAT TTC CAG TGG CTG TTA TCA AGA 2 74 

Asn Lys Arg Met Arg Thr Ala Ser A sr. Phe Gin Trp Leu Leu Ser Thr 
-25 -20 -15 

TTT ATT CTT CTA TAT CTA ATG AA? CAA GTA AAT AGG CAG AAA AAG GGG 322 
Phe lie Leu Leu Tyr Leu Met Asn Gin Val Asn Ser Gin Lys Lys Gly 

-io -5 : 

GCT CCT CAT GAT TTG AAG TGT GTA ACT AAC AAT TTG CAA GTG TGG AAC 37 C 

Ala Pro His Asp Leu Lys Cys Val Thr Asn Asn Leu Gin Val Trp Asn 
5 10 15 2 0 

TGT TCT TGG AAA GCA CCC TCT GGA ACA GGC CGT GGT ACT GAT TAT GAA 415 
Cys Ser Trp Lys Ala Pre Ser Gly Thr Gly Ara Gly Thr Asp Tyr Glu 
25 30 35 

GTT TGC ATT GAA AAC AGG TCC CGT TCT TGT TAT CAG TTG GAG AAA ACC 4 66 

Val Cys lie Glu Asn Arg Ser Arg Ser Cys Tyr Gin Leu Glu Lys Thr 
40 45 50 

AGT ATT AAA ATT CCA GCT CTT TCA CAT GGT GAT TAT GAA ATA ACA ATA 514 
Ser lie Lys lie Pro Ala Leu Ser His Gly Asp Tyr Glu He Thr lie 

55 60 65 

AAT TCT CTA CAT GAT TTT GGA AGT TCT ACA AGT AAA TTC ACA CTA AAT 562 
Asn Ser Leu His Asp Phe Gly Ser Ser Thr Ser Lys Phe Thr Leu Asn 
7 0 7 5 8 0 

GAA CAA AAC GTT TCC TTA ATT CCA GAT ACT CCA GAG ATC TTG AAT TTG 610 
Glu Gin Asn Val Ser Leu He Pro Asp Thr Pro Glu He Leu Asn Leu 
85 90 95 1 00 

TCT GCT GAT TTC TCA ACC TCT ACA TTA TAC CTA AAG TGG AAC GAC AGG 65 6 

Ser Ala Asp Phe Ser Thr Ser Thr Leu Tyr Leu Lys Trp Asn Asp Arg 
105 110 115 

GGT TCA GTT TTT CCA CAC CGC TCA AAT GTT ATC TGG GAA ATT AAA GTT 7 06 

Gly Ser Val Phe Pro His Arg Ser Asn Val lie Trp Glu He Lys Val 
120 125 130 

CTA CGT AAA GAG AGT ATG GAG CTC GTA AAA TTA GTG ACC CAC AAC ACA 7 54 

Leu Arg Lys Glu Ser Met Glu Leu Val Lys Leu Val Thr His Asn Thr 
135 140 * 145 

ACT CTG AAT GGC AAA GAT ACA CTT CAT CAC TGG AGT TGG GCC TCA GAT 6 02 

Thr Leu Asn Gly Lys Asp Thr Leu His His Trp Ser Trp Ala Ser Asp 
150 155 * I6C 

ATG CCC TTG GAA TGT GCC ATT CAT TTT GTG GAA ATT AGA TCC TAC ATT ESC 
Met Pro Leu Glu Cys Ala He His ?ne Vai Glu He Arg Cys Tyr He 
165 ' 170 1~5 * "* ISC 

GAC AAT CTT CAT TTT TCT GGT CTC GAA GAG TGG A^T GAC TGG AGC CCT ~ ^ - 

Aso A sr. Leu His Phe Ser Glv Leu Glu Glu Trp Ser Asp Trp Ser ?r; 



WO 93/10151 ^± PCT/LS92/10272 



58 

GTG AAG AAC ATT TCT TGG ATA CCT GAT TCT CAG ACT AAG GTT TTT CCT 94 6 

Val Lys Asn lie Ser Trp lie Pro Asp Ser Gin Thr Lys Val Phe Pro 

200 205 210 

CAA GAT AAA GTG ATA CTT GTA GGC TCA GAC ATA ACA TTT TGT TGT GTG 994 

Gin Asp Lys Val lie Leu Val Gly Ser Asp lie Thr Phe Cys Cys Val 

215 220 225 

AGT CAA GAA AAA GTG TTA TCA GCA CTG ATT GGC CAT ACA AAC TGC CCC 104 2 

Ser Gin Glu Lys Val Leu Ser Ala Leu lie Gly His Thr Asn Cys Pro 

230 235 " 240 

TTG ATC CAT CTT GAT GGG GAA AAT GTT GCA ATC AAG ATT CGT AAT ATT 10 90 

Leu lie His Leu Asp Gly Glu Asn Val Ala lie Lys lie Arg Asn lie 
245 250 255 260 

TCT GTT TCT GCA AGT AGT GGA ACA AAT GTA GTT TTT ACA ACC . GAA GAT 1136 

Ser Val Ser Ala Ser Ser Gly Thr Asn Val Val Phe Thr Thr Glu Asp 
265 270 275 

AAC ATA TTT GGA ACC GTT ATT TTT GCT GGA TAT CCA CCA GAT ACT CCT 1156 

Asn lie Phe Gly Thr Val lie Phe Ala Gly Tyr Pro Pro Asp Thr Pro 
280 285 290 

CAA CAA CTG AAT TGT GAG ACA CAT GAT TTA AAA GAA ATT ATA TGT AGT 12 34 

Gin Gin Leu Asn Cys Glu Thr His Asp Leu Lys Glu lie lie Cys Ser 

295 300 " 305 

TGG AAT CCA GGA AGG GTG ACA GCG TTG GTG GGC CCA CGT GCT ACA AGC 12 82 

Trp Asn Pro Gly Arg Val Thr Ala Leu Val Gly Pro Arg Ala Thr Ser 

310 315 320 

TAC ACT TTA GTT GAA AGT TTT TCA GGA AAA TAT GTT AGA CTT AAA AGA 1330 

Tyr Thr Leu Val Glu Ser Phe Ser Gly Lys Tyr Val Arg Leu Lys Arg 
325 330 335 340 

GCT GAA GCA CCT ACA AAC GAA AGC TAT CAA TTA TTA TTT CAA ATG CTT 137 6 

Ala Glu Ala Pro Thr Asn Glu Ser Tyr Gin Leu Leu Phe Gin Met Leu 
345 350 355 

CCA AAT CAA GAA ATA TAT AAT TTT ACT TTG AAT GCT CAC AAT CCG CTG 142 6 

Pro Asn Gin Glu lie Tyr Asn Phe Thr Leu Asn Ala His Asn Pro Leu 
360 365 370 

GGT CGA TCA CAA TCA ACA ATT TTA GTT AAT ATA ACT GAA AAA GTT TAT 1474 

Gly Arg Ser Gin Ser Thr lie Leu Val Asn lie Thr Glu Lys Val Tyr 

375 360 385 

CCC CAT ACT CCT ACT TCA TTC AAA GTG AAG GAT ATT AAT TCA ACA GCT 1522 

Pro His Thr Pro Thr Ser Phe Lys Val Lys Asp lie Asn Ser Thr Ala 

390 395 " 400 

GTT AAA CTT TCT TGG CAT TTA CCA GGC AAC TTT GCA AAG ATT AAT TTT 1 57 C 

Val Lys Leu Ser Trp His Leu Pro Gly Asn Phe Ala Lys lie Asn Phe 
405 410 415 420 

TTA TGT GAA ATT GAA ATT AAG AAA TCT AAT TCA GTA CAA GAG CAG CGG 1 61 E 

Leu Cys Glu lie Glu lie Lys Lys Ser Asn Ser Val Gin Glu Gin Arg 
425 430 435 
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AAT GTC AC A ATC AAA GGA GTA GAA AAT TCA AGT TAT CTT GTT GCT CTG 1666 

Asn Val Thr He Lys Gly Val Glu Asn Ser Ser Tyr Leu Val Ala Leu 

44 0 ' 44 5 4 50 

GAC AAG TTA AAT CCA TAC ACT CTA TAT ACT TTT CGG ATT CGT TGT TCT 

Asp Lys Leu Asn Pro Tyr Thr Leu Tyr Thr Phe Arg lie Arg Cys Ser 
455 "* 460 465 

ACT GAA ACT TTC TGG AAA TGG AGC AAA TGG AGC AAT AAA AAA CAA CAT 17 62 

Thr Glu Thr Phe Trp Lys Trp Ser Lys Trp Ser Asn Lys Lys Gin His 
47C 475 48C 

TTA ACA ACA GAA GCC AGT CCT TCA AAG GGG CCT GAT ACT TGG AGA GAG 161C 

Leu Thr Thr Giu Aia Ser Pro Ser Lys Giy Pro Asp Thr Trp Arg Giu 

465 490 495 500 

TGG AGT TCT GAT GGA AAA AAT TTA ATA ATC TAT TGG AAG CCT TTA CCC 16 55 

Trp Ser Ser Asp Gly Lys Asn Leu lie lie Tyr Trp Lys Pro Leu Pre 

505 "* 51 C * 515 

ATT AAT GAA GCT AAT GGA AAA ATA CTT TCC TAC AAT GTA TCG TGT TCA 190 6 

He Asn Giu Ala Asn Gly Lys lie Leu Ser Tyr Asn Val Ser Cys Ser 

520 525 53C 

TCA GAT GAG GAA ACA CAG TCC CTT TCT GAA ATC CCT GAT CCT CAG CAC 1954 

Ser Asp Glu Glu Thr Gin Ser Leu Ser Glu He Pro Asp Pro Gin His 
535 540 545 

AAA GCA GAG ATA CGA CTT GAT AAG AAT GAC TAC ATC ATC AGC GTA GTG 2 0 02 

Lys Ala Giu He Arg Leu Asp Lys Asn Asp Tyr He He Ser Val Val 
550 555 * 560 

GCT AAA AAT TCT GTG GGC TCA TCA CCA CCT TCC AAA ATA GCG AGT ATG 2 05 0 

Ala Lvs Asn Ser Val Gly Ser Ser Pro Pro Ser Lys lie Aia Ser Met 

565 "* 570 575 5 8 0 

GAA ATT CCA AAT GAT GAT CTC AAA ATA GAA CAA GTT GTT GGG ATG GGA 2 09E 

Glu He Pro Asn Asp Asp Leu Lys He Glu Gin Val Val Giy Met Gly 

585 590 595 

AAG GGG ATT CTC CTC ACC TGG CAT TAC GAC CCC AAC ATG ACT TGC GAC 214 c 

Lys Giy He Leu Leu Thr Trp His Tyr Asp Pro Asn Met Thr Cys Asp 

600 605 610 

TAC GTC ATT AAG TGG TGT AAC TCG TCT CGG TCG GAA CCA TGC CTT ATG 2194 

Tyr Val He Lys Trp Cys Asn Ser Ser Arg Ser Giu Pro Cys Leu Met 
615 " 620 625 

GAC TGG AGA AAA GTT CCC TCA AAC AGC ACT GAA ACT GTA ATA GAA TCT 22 4 2 

Asp Tro Arg Lys Val Pro Ser Asn Ser Thr Giu Thr Val He Glu Ser 
630 635 640 

GAT GAG TTT CGA CCA G3T ATA AGA TAT AAT TTT TTC CTG TAT GGA TGC 229 1 

Asd Glu Phe Arc Pre Glv He Arg Tvr Asn Phe Pr.e Leu Tyr Gly Cys 

645 " 65C ' * 655 66: 

AGA AAT CAA GGA TAT CAA TTA TTA CGC TCC ATG ATT GGA TAT ATA GAA 2 5 3 E 

Ara Asn Gin Glv Tvr Gin Leu Leu Arg Ser Met He Gly Tyr He 31- 
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GAA TTG GCT CCC ATT GTT GCA CCA AAT TTT ACT GTT GAG GAT ACT TCT 2386 
Glu Leu Ala Pro lie Val Ala Pro Asn Phe Thr Val Glu Asp Thr Ser 
680 685 690 

GCA GAT TCG ATA TTA GTA AAA TGG GAA GAC ATT CCT GTG GAA GAA CTT 2 4 34 

Ala Asp Ser lie Leu Val Lys Trp Glu Asp lie Pro Val Glu Glu Leu 
695 700 705 

AGA GGC TTT TTA AGA GGA TAT TTG TTT TAC TTT GGA AAA GGA GAA AGA 2 4 62 

Arg Gly Phe Leu Arg Gly Tyr Leu Phe Tyr Phe Gly Lys Gly Glu Arg 
710 715 720 

GAC ACA TCT AAG ATG AGG GTT TTA GAA TCA GGT CGT TCT GAC ATA AAA 2 53C 

Asp Thr Ser Lys Met Arg Val Leu Glu Ser Gly Arg Ser Asp lie Lys 
725 730 735 740 

GTT AAG AAT ATT ACT GAC ATA TCC CAG AAG ACA CTG AGA ATT GCT GAT 2 57 6 

Val Lys Asn lie Thr Asp lie Ser Gin Lys Thr Leu Arg lie Ala Asp 
745 750 755 

CTT CAA GGT AAA ACA AGT TAC CAC CTG GTC TTG CGA GCC TAT ACA GAT 2 62 6 

Leu Gin Gly Lys Thr Ser Tyr His Leu Val Leu Arg Ala Tyr Thr Asp 
760 ' 765 770 

GGT GGA GTG GGC CCG GAG AAG AGT ATG TAT GTG GTG ACA AAG GAA AAT 2 67 4 

Gly Gly Val Gly Pro Glu Lys Ser Met Tyr Val Val Thr Lys Glu Asn 
775 780 785 

TCT GTG GGA TTA ATT ATT GCC ATT CTC ATC CCA GTG GCA GTG GCT GTC 2722 
Ser Val Gly Leu lie lie Ala lie Leu He Pro Val Ala Val Ala Val 
790 795 800 

ATT GTT GGA GTG GTG ACA AGT ATC CTT TGC TAT CGG AAA CGA GAA TGG 2 77 0 

He Val Gly Val Val Thr Ser He Leu Cys Tyr Arg Lys Arg Glu Trp 
805 810 815 820 

ATT AAA GAA ACC TTC TAC CCT GAT ATT CCA AAT CCA GAA AAC TGT AAA 2818 
He Lys Glu Thr Phe Tyr Pro Asp He Pro Asn Pro Glu Asn Cys Lys 
825 830 835 

GCA TTA CAG TTT CAA AAG AGT GTC TGT GAG GGA AGC AGT GCT CTT AAA 2 8 66 

Ala Leu Gin Phe Gin Lys Ser Val Cys Glu Gly Ser Ser Ala Leu Lys 
840 845 850 

ACA TTG GAA ATG AAT CCT TGT ACC CCA AAT AAT GTT GAG GTT CTG GAA 2 914 

Thr Leu Glu Met Asn Pro Cys Thr Pro Asn Asn Val Glu Val Leu Glu 
855 860 865 

ACT CGA TCA GCA TTT CCT AAA ATA GAA GAT ACA GAA ATA ATT TCC CCA 2 9 62 

Thr Arg Ser Ala Phe Pro Lys lie Glu Asp Thr Glu He He Ser Pro 
870 875 880 

GTA GCT GAG CGT CCT GAA GAT CGC TCT GAT GCA GAG CCT GAA AAC CAT 301 I 

Val Ala Glu Arg Pro Glu Asp Arg Ser Asp Ala Glu Pro Glu Asn His 
885 890 895 90C 

GTG GTT GTG TCC TAT TGT CCA CCC ATC ATT GAG GAA GAA ATA CCA AAC 30 5: 

Val Val Val Ser Tyr Cys Pre Pro lie He Glu Glu Glu lie Pre Asn 
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CCA GCC GCA GAT GAA GOT GGA GGG ACT \jwA CAG GTT A.TT TAC ATT GAT 
Pro Ala Ala Asp Glu Ala Gly Gly Tr.r Ala Glr. Val lie Tyr He Asp 



92 0 



GTT CAG TCG ATG TAT CAG CCT CAA GCA AAA CCA GAA GAA AAA AAA AAA 3154 

Val Gin Ser Met Tyr Glr. Pro Gin Ala Lys Pre Glu Glu Lys Lys Lys 

935 ' 940 945 

AAA AGC AGG TCG TCT CGT TCC AAG ATC T 51 El 

Lys Ser Arg Ser Ser Arg Ser Lys He 

950 955 



(2) INFORMATION FOR SEQ ID NO: 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1 002 air.ino acids 

(B) TYPE: amino acid 
ID) TOPOLOGY: linear 

<ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : £ : 

Met Met Asp He Tyr Val Cys Leu Lys Arg Pro Ser Trp Met Val Asp 
-44 * -40 * " -35 -30 

Asn Lys Arg Met Arg Thr Ala Ser Asn Phe Gin Trp Leu Leu Ser Thr 
-25 -20 -15 

Phe He Leu Leu Tyr Leu Met Asn Gin Val Asn Ser Gin Lys Lys Gly 
-10 -5 1 

Ala Pro His Asp Leu Lys Cys Val Thr Asn Asn Leu Gin Val Trp Asn 
5 10 15 20 

Cys Ser Trp Lys Ala Pro Ser Gly Thr Gly Arg Gly Thr Asp Tyr Glu 
25 30 35 

Val Cys lie Glu Asn Arg Ser Arg Ser Cys Tyr Gin Leu Glu Lys Thr 
40 45 50 

Ser He Lvs lie Pro Ala Leu Ser His Gly Asp Tyr Glu He Thr lie 
55 60 65 

Asn Ser Leu His Asp Phe Gly Ser Ser Thr Ser Lys Phe Thr Leu Asn 
70 75 SO 

Glu Gin Asn Val Ser Leu lie Pro Asp Thr Pre Glu lie Leu Asn Leu 
85 90 95 ICC 

Ser Ala Asp Pne Ser Thr Ser Thr Leu Tyr Leu Lys Trp Asn Asp Arg 
105 110 115 

Gly Ser Val Phe Pre His Arg Ser Asn Val He Trp Glu lie Lys Val 
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Thr Leu Asn Gly Lys Asp Thr Leu His His Trp Ser Trp Ala Ser Asp 
150 155 16C 

Met Pro Leu Glu Cys Ala He His Phe Val Glu He Arg Cys Tyr He 
165 170 175 18C 

Asp Asn Leu His Phe Ser Gly Leu Glu Glu Trp Ser Asp Trp Ser Pre 
185 190 195 

Val Lys Asn He Ser Trp He Pro Asp Ser Gin Thr Lys Val Phe Pre 
200 * 205 210 

Gin Asp Lys Val He Leu Val Gly Ser Asp He Thr Phe Cys Cys Val 
215 220 225 

Ser Gin Glu Lys Val Leu Ser Ala Leu He Gly His Thr Asn Cys Pro 
230 235 240 

Leu He His Leu Asp Gly Glu Asn Val Ala He Lys He Arg Asn He 
245 250 255 260 

Ser Val Ser Ala Ser Ser Gly Thr Asn Val Val Phe Tnr Thr Glu Asp 
265 "* 270 275 

Asn lie Phe Gly Thr Val He Phe Ala Gly Tyr Pro Pro Asp Thr Pro 
280 285 290 

Gin Gin Leu Asn Cys Glu Thr His Asp Leu Lys Glu He lie Cys Ser 
295 300 305 

Trp Asn Pro Gly Arg Val Thr Ala Leu Val Gly Pro Arg Ala Thr Ser 
310 315 320 

Tyr Thr Leu Val Glu Ser Phe Ser Gly Lys Tyr Val Arg Leu Lys Arg 
325 330 335 340 

Ala Glu Ala Pro Thr Asn Glu Ser Tyr Gin Leu Leu Phe Gin Met Leu 
345 350 355 

Pro Asn Gin Glu He Tyr Asn Phe Thr Leu Asn Ala His Asn Pro Leu 
360 365 370 

Gly Arg Ser Gin Ser Thr He Leu Val Asn He Thr Glu Lys Val Tyr 
375 380 385 

Pro His Thr Pro Thr Ser Phe Lys Val Lys Asp lie Asn Ser Thr Ala 
390 395 400 

Val Lys Leu Ser Trp His Leu Pro Gly Asn Phe Ala Lys lie Asn Phe 
405* 41 C 415 4 2 C 

Leu Cys Glu lie Glu lie Lys Lys Ser Asn Ser Val Gin Glu Gin Arg 



425 



430 



435 



Asn Val Thr lie Lys Gly Val Glu Asn Ser Ser Tyr Leu Val Ala Leu 
440 445 450 
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Thr GIu Thr Pne Trp Lys Trp Ser Lys Trp Ser A sr. Lys Lys 31r. His 
470 ' * 4^5 4S0 

Leu Thr Thr GIu Ala Ser Pre Ser Lvs Gly Pre Asp Thr Trp Arg Glu 
485 49C 495 500 

Trp Ser Ser Asc Glv Lvs Asn Leu He He Tyr Trp Lys Pre Leu Pre 
505 " 510 515 

He Asn GIu Ala Asn Glv Lvs He Leu Ser Tyr Asn Val Ser Cys Ser 
52C 525 530 

Ser Asp Glu Glu Thr Gin Ser Leu Ser Glu He Pro Asp Pro Gin His 
535 540 545 

Lys Ala Glu He Arg Leu Asp Lys Asn Asp Tyr He He Ser Val Val 

550 555 560 

Ala Lys Asn Ser Val Gly Ser Ser Pro Pro Ser Lys He Ala Ser Met 
565 570 575 580 

Glu He Pro Asn Aso Asp Leu Lys He Glu Gin Val Val Gly Met Gly 
5 8*5 590 595 

Lys Gly lie Leu Leu Thr Trp His Tyr Asp Pro Asn Met Thr Cys Asp 
600 ' 605 610 

Tyr Val lie Lys Trp Cys Asn Ser Ser Arg Ser Glu Pro Cys Leu Met 
615 " 620 625 

Asp Trp Arg Lys Val Pro Ser Asn Ser Thr Glu Thr Val lie GIu Ser 
630 635 64 0 

hsv Glu Phe Arg Pro Gly He Arg Tyr Asn Phe Phe Leu Tyr Gly Cys 
645 650 655 660 

Arg Asn Gin Gly Tyr Gin Leu Leu Arg Ser Met He Gly Tyr lie Glu 
665 670 675 

Glu Leu Ala Pro lie Val Ala Pro Asn Phe Thr Val Glu Asp Thr Ser 
680 685 690 

Ala Asp Ser lie Leu Val Lys Trp Glu Asp He Pro Val Glu Glu Leu 
695 700 705 

Arg Gly Phe Leu Arg Gly Tyr Leu Phe Tyr Phe Gly Lys Gly Glu Arg 
710 715 720 

Asd Thr Ser Lvs Met Arc Val Leu Glu Ser Gly Arg Ser Asp He Lys 
725 ' 730 735 74C 

Val Lvs Asn He Thr Asc He Ser Glr. Lys Tr.r Leu Arg He Ala Asp 
74 5 750 ^5 5 

Leu Gin Gly Lys Thr Ser Tyr His Leu Val Leu Arc Ala Tyr Tr.r Asp 

761 "65 7~ : 
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Ser Val Gly Leu He He Aid He Leu lie Pro Val Ala Val Ala Val 
790 795 80C 

He Val Gly Val Val Thr Ser He Leu Cys Tyr Arg Lys Arg Glu Trp 
805 810 815 820 

He Lys Glu Thr Phe Tyr Pro Asp He Pro Asn Pro Glu Asn Cys Lys 
825 830 835 

Ala Leu Gin Phe Gin Lys Ser Val Cys Glu Gly Ser Ser Ala Leu Lys 
840 845 850 

Thr Leu Glu Met Asn Pro Cys Thr Pro Asn Asn Val Glu Val Leu Glu 
855 860 865 

Thr Arg Ser Ala Phe Pro Lys lie Glu Asp Thr Glu He lie Ser Pro 
870 875 880 

Val Ala Glu Arg Pro Glu Asp Arg Ser Asp Ala Glu Pre Glu Asn His 
885 890 895 900 

Val Val Val Ser Tyr Cys Pro Pro lie lie Glu Glu Glu He Pro Asn 
905 910 915 

Pro Ala Ala Asp Glu Ala Gly Gly Thr Ala Gin Val lie Tyr lie Asp 
920 925 930 

Val Gin Ser Met Tyr Gin Pro Gin Ala Lys Pro Glu Glu Lys Lys Lys 
935 940 945 

Lys Ser Arg Ser Ser Arg Ser Lys He 
950 955 

(2) INFORMATION FOR SEQ ID NO : 7 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 100 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 
GATCCGGTGG AGGTGGTTCT GGTGGAGGTG GTTCAGGTGG TGGAGGATCA GGAGGTGGTG 
GATCAGGTGG AGGAGGTTCT GGAGGTGGAG GTTCCGGAAT 
(2) INFORMATION FOR SEQ ID NO : 8 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 36 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(xi) SEQUENCE DESCRIPTION: SEQ 1 0 NO 1 : E : 
GATATGTCGA CGATGATGGA TATTTACGTA TGTTT3 
(2) INFORMATION FOR SEQ ID NO: 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 46 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 9 : 
GCATGGATCC ACCTCCTCCA GAATTTTCCT TTGTCACCAC ATACATAC 
(2) INFORMATION FOR SEQ ID NO : 1 C : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 49 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:1C: 

CGCGTCCGGA GGAGGTGGAT CTGAACTTCT AGATCCATGT GGTTATATC 

(2) INFORMATION FOR SEQ ID NO: 11: 

<i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 37 base pairs 
(3) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: II : 
GCATGCGGCC 333TATTCAA TTTCTCCT73 A3 3 AAA C 
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(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 
GATATGTCGA CAAGATGTTG ACGTTGCAGA CTTGG 
(2) INFORMATION FOR SEQ ID NO: 13: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 41 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 1 3 : 
GCATGGATCC ACCTCCTCCT TCAATTTCTC CTTGAGCAAA C 
(2) INFORMATION FOR SEQ ID NO: 14: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 44 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 1 4 : 

CGCGTCCGGA GGAGGTGGTA GCCAGAAAAA GGGGGCTCCT CATG 

(2) INFORMATION FOR SEQ ID NO: 15: 

(i) SEQUENCE CHARACTERISTICS : 
(A) LENGTH: 4 3 base pairs 
£B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 1 5 : 
GCATGCGGCC GCTAAGAATT TTCCTTTGTC ACCACATACA TAC 
(2) INFORMATI ON FOR SEQ ID N0:16: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 45 base pairs 

(B) TYPE: nucleic acid 
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(C) STRANDEDNESS : single 
< D ) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 1 € : 
GCATAGATCT GGGCTCAGAA TTTTCCTTTG TCACCACATA CATAC 
(2) INFORMATION FOR SEQ ID NO: 17: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 38 base pairs 

(B) TYPE: nucleic acid 

(C) STRAKDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 1 7 : 
GCATAGATCT GGGCTCTTCA ATTTCTCCTT GAG C AAA C 
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What is claimed is: 

5 

1 . A receptor capable of binding oncostatin M and leukemia inhibitory 
factor, comprising gpl30 covalently linked to LIF-R. 

2. A receptor according to claim 1, wherein said receptor comprises a 
10 soluble gpl30 polypeptide covalently linked to a soluble LIF-R polypeptide. 

3. A receptor according to claim 1 wherein said receptor comprises gpl30 
covalently linked to LEF-R via a polypeptide linker. 

15 4. A receptor according to claim 3, wherein said receptor is a recombinant 

fusion protein of the formula: 



R1-L-R2 orR2-L-Ri 



20 wherein Ri represents gpl30; R2 represents LIF-R; and L represents a polypeptide 
linker. 



5. A receptor according to claim 4 wherein the polypeptide linker 
comprises from 20 to 100 amino acids selected from the group consisting of glycine, 

25 asparagine, serine, threonine, and alanine. 

6. A receptor according to claim 5 wherein the polypeptide linker 
comprises an amino acid sequence selected from the group consisting of : 
(Gly4-Ser-Gly5-Ser)2 and (Gly4-Ser) n , wherein n is 4-12. 

30 

7 . An isolated DNA sequence encoding the receptor of claim 4. 

8. A recombinant expression vector comprising the DNA sequence of 
claim 7. 

35 

9. A host cell containing the expression vector of claim 8. 
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10. A receptor according to claim 3, comprising a first fusion polypeptide 
that comprises an antibody Fc region polypeptide attached to the C-terminus of gpl30. 
and a second fusion polypeptide that comprises an antibody Fc region polypeptide 
attached to the C-terminus of LEF-R, wherein said first fusion polypeptide is linked to 

5 said second fusion polypeptide via disulfide bonds between the Fc region polypeptides. 

11. A receptor according to claim 1, 4, or 10, wherein: 

a) said gpl30 is encoded by an isolated DNA selected from the group 
10 consisting of a first DNA sequence comprising nucleotides 244-2369 of SEQ ID NO:l, 
a second DNA sequence comprising nucleotides 310-2369 of SEQ ID NO:l, and a 
third DNA sequence that will hybridize to said second DNA sequence under moderately 
stringent conditions; and 

15 b) said LIF-R is encoded by an isolated DNA selected from the group 

consisting of a first DNA sequence comprising nucleotides 179-3182 of SEQ ID NO:5, 
a second DNA sequence comprising nucleotides 31 1-3182 of SEQ ID NO:5, and a 
third DNA sequence that will hybridize to said second DNA sequence under moderately 
stringent conditions. 

20 

12. A receptor according to claim 10 wherein said gp!30 is a soluble gpl30 
polypeptide and said LEF-R is a soluble LIF-R polypeptide. 

13. A fusion protein comprising an antibody Fc region polypeptide attached 
25 to the C-terminus of a soluble gpl 30 polypeptide. 

14. An isolated DNA sequence encoding a fusion protein according to claim 

13. 

30 15. A fusion protein comprising an antibody Fc region polypeptide attached 

to the C-terminus of a soluble LIF-R polypeptide. 

1 6. An isolated DNA sequence encoding a fusion protein according to claim 

15. 

35 

17. A homodimenc receptor comprising two fusion proteins according to 
claim 15. linked via disulfide bonds between the Fc region polypeptides 
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18. A process for preparing a receptor according to claim 4, comprising 
culturing a host cell transformed with an expression vector comprising a DNA sequence 
that encodes said fusion protein under conditions that promote expression of said 

5 fusion protein, and recovering said fusion protein. 

19. A process for preparing a receptor according to claim 10, comprising 
culturing a host cell co-transfected with a first expression vector encoding said first 
fusion polypeptide and with a second expression vector encoding said second fusion 

10 polypeptide under conditions that promote expression of said first and second fusion 
polypeptides, and recovering said receptor. 



15 



20. A pharmaceutical composition for treating a disorder mediated by 
oncostatin M or LIF, comprising the receptor of claim 1, 4, 10, or 12, and a suitable 
diluent or carrier. 
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